基于深度学习的说话人匿名化方法
Speaker anonymization method based on deep learning
随着隐私立法的增加,个人身份信息的保护变得愈发重要,特别是语音等生物识别数据。语音中可能包含性别、身份等敏感信息,面临数据泄露风险。用户不愿将这些信息暴露给不受信任的第三方,因此对语音数据的隐私保护显得尤为重要。本文探讨了当前语音数据隐私保护的研究方法,如基于密码学的保护、分布式学习以及匿名化方法等。提出了一种基于深度学习的说话人匿名化方案,结合了TF-IDF与TextRank敏感词提取算法,以提高语音内容的安全性。同时,优化了x-vectorr模型,通过引入改进的WGAN算法,增强了声纹特征的匿名化处理能力。实验结果表明,本文提出的方法在关键词提取和说话人特征提取任务中均表现优异,显著优于传统方法,具有良好的应用前景。
With the increase of privacy legislation, the protection of personal identity information has become increasingly important, especially biometric data such as voice. Speech may contain sensitive information such as gender and identity, posing a risk of data leakage. Users are unwilling to expose this information to untrusted third parties, so privacy protection of voice data is particularly important. This article explores current research methods for voice data privacy protection, such as cryptographic based protection, distributed learning, and anonymization methods. A speaker anonymization scheme based on deep learning is proposed, which combines TF-IDF and TextRank sensitive word extraction algorithms to improve the security of speech content. At the same time, the x-vector model was optimized, and the improved WGAN algorithm was introduced to enhance the anonymity processing capability of voiceprint features. The experimental results show that the method proposed in this paper performs well in both keyword extraction and speaker feature extraction tasks, significantly outperforming traditional methods and having good application prospects.
张珂雯、彭海朋
计算技术、计算机技术电子技术应用
语音隐私说话人匿名化声纹识别生成对抗网络
Voice PrivacySpeaker SnonymizationVoiceprint RecognitionGenerate Adversarial Networks
张珂雯,彭海朋.基于深度学习的说话人匿名化方法[EB/OL].(2025-02-25)[2025-08-16].http://www.paper.edu.cn/releasepaper/content/202502-104.点此复制
评论