首页|Exploiting Context-dependent Duration Features for Voice Anonymization Attack Systems

Exploiting Context-dependent Duration Features for Voice Anonymization Attack Systems

来源：

英文摘要

The temporal dynamics of speech, encompassing variations in rhythm, intonation, and speaking rate, contain important and unique information about speaker identity. This paper proposes a new method for representing speaker characteristics by extracting context-dependent duration embeddings from speech temporal dynamics. We develop novel attack models using these representations and analyze the potential vulnerabilities in speaker verification and voice anonymization systems.The experimental results show that the developed attack models provide a significant improvement in speaker verification performance for both original and anonymized data in comparison with simpler representations of speech temporal dynamics reported in the literature.

作者：Natalia Tomashenko、Emmanuel Vincent、Marc Tommasi

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Natalia Tomashenko,Emmanuel Vincent,Marc Tommasi.Exploiting Context-dependent Duration Features for Voice Anonymization Attack Systems[EB/OL].(2025-07-21)[2025-08-10].https://arxiv.org/abs/2507.15214.点此复制

Exploiting Context-dependent Duration Features for Voice Anonymization Attack Systems

Exploiting Context-dependent Duration Features for Voice Anonymization Attack Systems

评论