首页|Generalizable Detection of Audio Deepfakes

Generalizable Detection of Audio Deepfakes

来源：

英文摘要

In this paper, we present our comprehensive study aimed at enhancing the generalization capabilities of audio deepfake detection models. We investigate the performance of various pre-trained backbones, including Wav2Vec2, WavLM, and Whisper, across a diverse set of datasets, including those from the ASVspoof challenges and additional sources. Our experiments focus on the effects of different data augmentation strategies and loss functions on model performance. The results of our research demonstrate substantial enhancements in the generalization capabilities of audio deepfake detection models, surpassing the performance of the top-ranked single system in the ASVspoof 5 Challenge. This study contributes valuable insights into the optimization of audio models for more robust deepfake detection and facilitates future research in this critical area.

作者：Jose A. Lopez、Georg Stemmer、Héctor Cordourier Maruri

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Jose A. Lopez,Georg Stemmer,Héctor Cordourier Maruri.Generalizable Detection of Audio Deepfakes[EB/OL].(2025-07-02)[2025-07-16].https://arxiv.org/abs/2507.01750.点此复制

Generalizable Detection of Audio Deepfakes

Generalizable Detection of Audio Deepfakes

评论