Multi-Target Backdoor Attacks Against Speaker Recognition
Multi-Target Backdoor Attacks Against Speaker Recognition
In this work, we propose a multi-target backdoor attack against speaker identification using position-independent clicking sounds as triggers. Unlike previous single-target approaches, our method targets up to 50 speakers simultaneously, achieving success rates of up to 95.04%. To simulate more realistic attack conditions, we vary the signal-to-noise ratio between speech and trigger, demonstrating a trade-off between stealth and effectiveness. We further extend the attack to the speaker verification task by selecting the most similar training speaker - based on cosine similarity - as a proxy target. The attack is most effective when target and enrolled speaker pairs are highly similar, reaching success rates of up to 90% in such cases.
Alexandrine Fortier、Sonal Joshi、Thomas Thebaud、Jesus Villalba Lopez、Najim Dehak、Patrick Cardinal
计算技术、计算机技术
Alexandrine Fortier,Sonal Joshi,Thomas Thebaud,Jesus Villalba Lopez,Najim Dehak,Patrick Cardinal.Multi-Target Backdoor Attacks Against Speaker Recognition[EB/OL].(2025-08-13)[2025-08-24].https://arxiv.org/abs/2508.08559.点此复制
评论