Open-Set Source Tracing of Audio Deepfake Systems
Open-Set Source Tracing of Audio Deepfake Systems
Existing research on source tracing of audio deepfake systems has focused primarily on the closed-set scenario, while studies that evaluate open-set performance are limited to a small number of unseen systems. Due to the large number of emerging audio deepfake systems, robust open-set source tracing is critical. We leverage the protocol of the Interspeech 2025 special session on source tracing to evaluate methods for improving open-set source tracing performance. We introduce a novel adaptation to the energy score for out-of-distribution (OOD) detection, softmax energy (SME). We find that replacing the typical temperature-scaled energy score with SME provides a relative average improvement of 31% in the standard FPR95 (false positive rate at true positive rate of 95%) measure. We further explore SME-guided training as well as copy synthesis, codec, and reverberation augmentations, yielding an FPR95 of 8.3%.
Nicholas Klein、Hemlata Tak、Elie Khoury
计算技术、计算机技术
Nicholas Klein,Hemlata Tak,Elie Khoury.Open-Set Source Tracing of Audio Deepfake Systems[EB/OL].(2025-07-09)[2025-07-20].https://arxiv.org/abs/2507.06470.点此复制
评论