|国家预印本平台
首页|Towards Temporally Explainable Dysarthric Speech Clarity Assessment

Towards Temporally Explainable Dysarthric Speech Clarity Assessment

Towards Temporally Explainable Dysarthric Speech Clarity Assessment

来源:Arxiv_logoArxiv
英文摘要

Dysarthria, a motor speech disorder, affects intelligibility and requires targeted interventions for effective communication. In this work, we investigate automated mispronunciation feedback by collecting a dysarthric speech dataset from six speakers reading two passages, annotated by a speech therapist with temporal markers and mispronunciation descriptions. We design a three-stage framework for explainable mispronunciation evaluation: (1) overall clarity scoring, (2) mispronunciation localization, and (3) mispronunciation type classification. We systematically analyze pretrained Automatic Speech Recognition (ASR) models in each stage, assessing their effectiveness in dysarthric speech evaluation (Code available at: https://github.com/augmented-human-lab/interspeech25_speechtherapy, Supplementary webpage: https://apps.ahlab.org/interspeech25_speechtherapy/). Our findings offer clinically relevant insights for automating actionable feedback for pronunciation assessment, which could enable independent practice for patients and help therapists deliver more effective interventions.

Seohyun Park、Chitralekha Gupta、Michelle Kah Yian Kwan、Xinhui Fung、Alexander Wenjun Yip、Suranga Nanayakkara

医学研究方法神经病学、精神病学

Seohyun Park,Chitralekha Gupta,Michelle Kah Yian Kwan,Xinhui Fung,Alexander Wenjun Yip,Suranga Nanayakkara.Towards Temporally Explainable Dysarthric Speech Clarity Assessment[EB/OL].(2025-05-31)[2025-07-22].https://arxiv.org/abs/2506.00454.点此复制

评论