|国家预印本平台
首页|Multimodal Assessment of Speech Impairment in ALS Using Audio-Visual and Machine Learning Approaches

Multimodal Assessment of Speech Impairment in ALS Using Audio-Visual and Machine Learning Approaches

Multimodal Assessment of Speech Impairment in ALS Using Audio-Visual and Machine Learning Approaches

来源:Arxiv_logoArxiv
英文摘要

The analysis of speech in individuals with amyotrophic lateral sclerosis is a powerful tool to support clinicians in the assessment of bulbar dysfunction. However, current methods used in clinical practice consist of subjective evaluations or expensive instrumentation. This study investigates different approaches combining audio-visual analysis and machine learning to predict the speech impairment evaluation performed by clinicians. Using a small dataset of acoustic and kinematic features extracted from audio and video recordings of speech tasks, we trained and tested some regression models. The best performance was achieved using the extreme boosting machine regressor with multimodal features, which resulted in a root mean squared error of 0.93 on a scale ranging from 5 to 25. Results suggest that integrating audio-video analysis enhances speech impairment assessment, providing an objective tool for early detection and monitoring of bulbar dysfunction, also in home settings.

Francesco Pierotti、Andrea Bandini

医学研究方法神经病学、精神病学

Francesco Pierotti,Andrea Bandini.Multimodal Assessment of Speech Impairment in ALS Using Audio-Visual and Machine Learning Approaches[EB/OL].(2025-05-27)[2025-06-15].https://arxiv.org/abs/2505.21093.点此复制

评论