首页|Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech

Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech

来源：

英文摘要

Dysarthric speech poses significant challenges for automatic speech recognition (ASR) systems due to its high variability and reduced intelligibility. In this work we explore the use of diffusion models for dysarthric speech enhancement, which is based on the hypothesis that using diffusion-based speech enhancement moves the distribution of dysarthric speech closer to that of typical speech, which could potentially improve dysarthric speech recognition performance. We assess the effect of two diffusion-based and one signal-processing-based speech enhancement algorithms on intelligibility and speech quality of two English dysarthric speech corpora. We applied speech enhancement to both typical and dysarthric speech and evaluate the ASR performance using Whisper-Turbo, and the subjective and objective speech quality of the original and enhanced dysarthric speech. We also fine-tuned Whisper-Turbo on the enhanced speech to assess its impact on recognition performance.

作者：Dimme de Groot、Tanvina Patel、Devendra Kayande、Odette Scharenborg、Zhengjun Yue

作者单位：

DOI：10.21437/Interspeech.2025-2768

学科分类：通信无线通信

推荐引用：Dimme de Groot,Tanvina Patel,Devendra Kayande,Odette Scharenborg,Zhengjun Yue.Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech[EB/OL].(2025-08-25)[2025-09-06].https://arxiv.org/abs/2508.17980.点此复制

Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech

Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech

评论