首页|Quantifying Source Speaker Leakage in One-to-One Voice Conversion

Quantifying Source Speaker Leakage in One-to-One Voice Conversion

来源：

英文摘要

Using a multi-accented corpus of parallel utterances for use with commercial speech devices, we present a case study to show that it is possible to quantify a degree of confidence about a source speaker's identity in the case of one-to-one voice conversion. Following voice conversion using a HiFi-GAN vocoder, we compare information leakage for a range speaker characteristics; assuming a "worst-case" white-box scenario, we quantify our confidence to perform inference and narrow the pool of likely source speakers, reinforcing the regulatory obligation and moral duty that providers of synthetic voices have to ensure the privacy of their speakers' data.

作者：Scott Wellington、Xuechen Liu、Junichi Yamagishi

作者单位：

DOI：10.1109/BIOSIG61931.2024.10786731

学科分类：计算技术、计算机技术

推荐引用：Scott Wellington,Xuechen Liu,Junichi Yamagishi.Quantifying Source Speaker Leakage in One-to-One Voice Conversion[EB/OL].(2025-04-22)[2025-07-01].https://arxiv.org/abs/2504.15822.点此复制

Quantifying Source Speaker Leakage in One-to-One Voice Conversion

Quantifying Source Speaker Leakage in One-to-One Voice Conversion

评论