A Fourier Explanation of AI-music Artifacts
A Fourier Explanation of AI-music Artifacts
The rapid rise of generative AI has transformed music creation, with millions of users engaging in AI-generated music. Despite its popularity, concerns regarding copyright infringement, job displacement, and ethical implications have led to growing scrutiny and legal challenges. In parallel, AI-detection services have emerged, yet these systems remain largely opaque and privately controlled, mirroring the very issues they aim to address. This paper explores the fundamental properties of synthetic content and how it can be detected. Specifically, we analyze deconvolution modules commonly used in generative models and mathematically prove that their outputs exhibit systematic frequency artifacts -- manifesting as small yet distinctive spectral peaks. This phenomenon, related to the well-known checkerboard artifact, is shown to be inherent to a chosen model architecture rather than a consequence of training data or model weights. We validate our theoretical findings through extensive experiments on open-source models, as well as commercial AI-music generators such as Suno and Udio. We use these insights to propose a simple and interpretable detection criterion for AI-generated music. Despite its simplicity, our method achieves detection accuracy on par with deep learning-based approaches, surpassing 99% accuracy on several scenarios.
Darius Afchar、Gabriel Meseguer-Brocal、Kamil Akesbi、Romain Hennequin
数学信息科学、信息技术计算技术、计算机技术
Darius Afchar,Gabriel Meseguer-Brocal,Kamil Akesbi,Romain Hennequin.A Fourier Explanation of AI-music Artifacts[EB/OL].(2025-06-23)[2025-07-16].https://arxiv.org/abs/2506.19108.点此复制
评论