|国家预印本平台
首页|ClaritySpeech: Dementia Obfuscation in Speech

ClaritySpeech: Dementia Obfuscation in Speech

ClaritySpeech: Dementia Obfuscation in Speech

来源:Arxiv_logoArxiv
英文摘要

Dementia, a neurodegenerative disease, alters speech patterns, creating communication barriers and raising privacy concerns. Current speech technologies, such as automatic speech transcription (ASR), struggle with dementia and atypical speech, further challenging accessibility. This paper presents a novel dementia obfuscation in speech framework, ClaritySpeech, integrating ASR, text obfuscation, and zero-shot text-to-speech (TTS) to correct dementia-affected speech while preserving speaker identity in low-data environments without fine-tuning. Results show a 16% and 10% drop in mean F1 score across various adversarial settings and modalities (audio, text, fusion) for ADReSS and ADReSSo, respectively, maintaining 50% speaker similarity. We also find that our system improves WER (from 0.73 to 0.08 for ADReSS and 0.15 for ADReSSo) and speech quality from 1.65 to ~2.15, enhancing privacy and accessibility.

Dominika Woszczyk、Ranya Aloufi、Soteris Demetriou

语言学

Dominika Woszczyk,Ranya Aloufi,Soteris Demetriou.ClaritySpeech: Dementia Obfuscation in Speech[EB/OL].(2025-07-12)[2025-07-25].https://arxiv.org/abs/2507.09282.点此复制

评论