首页|Pulling Back the Curtain: Unsupervised Adversarial Detection via Contrastive Auxiliary Networks

Pulling Back the Curtain: Unsupervised Adversarial Detection via Contrastive Auxiliary Networks

来源：

英文摘要

Deep learning models are widely employed in safety-critical applications yet remain susceptible to adversarial attacks -- imperceptible perturbations that can significantly degrade model performance. Conventional defense mechanisms predominantly focus on either enhancing model robustness or detecting adversarial inputs independently. In this work, we propose an Unsupervised adversarial detection via Contrastive Auxiliary Networks (U-CAN) to uncover adversarial behavior within auxiliary feature representations, without the need for adversarial examples. U-CAN is embedded within selected intermediate layers of the target model. These auxiliary networks, comprising projection layers and ArcFace-based linear layers, refine feature representations to more effectively distinguish between benign and adversarial inputs. Comprehensive experiments across multiple datasets (CIFAR-10, Mammals, and a subset of ImageNet) and architectures (ResNet-50, VGG-16, and ViT) demonstrate that our method surpasses existing unsupervised adversarial detection techniques, achieving superior F1 scores against four distinct attack methods. The proposed framework provides a scalable and effective solution for enhancing the security and reliability of deep learning systems.

作者：Eylon Mizrahi、Raz Lapid、Moshe Sipper

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Eylon Mizrahi,Raz Lapid,Moshe Sipper.Pulling Back the Curtain: Unsupervised Adversarial Detection via Contrastive Auxiliary Networks[EB/OL].(2025-07-22)[2025-08-16].https://arxiv.org/abs/2502.09110.点此复制

Pulling Back the Curtain: Unsupervised Adversarial Detection via Contrastive Auxiliary Networks

Pulling Back the Curtain: Unsupervised Adversarial Detection via Contrastive Auxiliary Networks

评论