|国家预印本平台
首页|IsoNet: Causal Analysis of Multimodal Transformers for Neuromuscular Gesture Classification

IsoNet: Causal Analysis of Multimodal Transformers for Neuromuscular Gesture Classification

IsoNet: Causal Analysis of Multimodal Transformers for Neuromuscular Gesture Classification

来源:Arxiv_logoArxiv
英文摘要

Hand gestures are a primary output of the human motor system, yet the decoding of their neuromuscular signatures remains a bottleneck for basic neuroscience and assistive technologies such as prosthetics. Traditional human-machine interface pipelines rely on a single biosignal modality, but multimodal fusion can exploit complementary information from sensors. We systematically compare linear and attention-based fusion strategies across three architectures: a Multimodal MLP, a Multimodal Transformer, and a Hierarchical Transformer, evaluating performance on scenarios with unimodal and multimodal inputs. Experiments use two publicly available datasets: NinaPro DB2 (sEMG and accelerometer) and HD-sEMG 65-Gesture (high-density sEMG and force). Across both datasets, the Hierarchical Transformer with attention-based fusion consistently achieved the highest accuracy, surpassing the multimodal and best single-modality linear-fusion MLP baseline by over 10% on NinaPro DB2 and 3.7% on HD-sEMG. To investigate how modalities interact, we introduce an Isolation Network that selectively silences unimodal or cross-modal attention pathways, quantifying each group of token interactions' contribution to downstream decisions. Ablations reveal that cross-modal interactions contribute approximately 30% of the decision signal across transformer layers, highlighting the importance of attention-driven fusion in harnessing complementary modality information. Together, these findings reveal when and how multimodal fusion would enhance biosignal classification and also provides mechanistic insights of human muscle activities. The study would be beneficial in the design of sensor arrays for neurorobotic systems.

Eion Tyacke、Kunal Gupta、Jay Patel、Rui Li

生物科学研究方法、生物科学研究技术计算技术、计算机技术

Eion Tyacke,Kunal Gupta,Jay Patel,Rui Li.IsoNet: Causal Analysis of Multimodal Transformers for Neuromuscular Gesture Classification[EB/OL].(2025-06-20)[2025-06-29].https://arxiv.org/abs/2506.16744.点此复制

评论