|国家预印本平台
首页|Pulling Back the Curtain: Unsupervised Adversarial Detection via Contrastive Auxiliary Networks

Pulling Back the Curtain: Unsupervised Adversarial Detection via Contrastive Auxiliary Networks

Pulling Back the Curtain: Unsupervised Adversarial Detection via Contrastive Auxiliary Networks

来源:Arxiv_logoArxiv
英文摘要

Deep learning models are widely employed in safety-critical applications yet remain susceptible to adversarial attacks -- imperceptible perturbations that can significantly degrade model performance. Conventional defense mechanisms predominantly focus on either enhancing model robustness or detecting adversarial inputs independently. In this work, we propose an Unsupervised adversarial detection via Contrastive Auxiliary Networks (U-CAN) to uncover adversarial behavior within auxiliary feature representations, without the need for adversarial examples. U-CAN is embedded within selected intermediate layers of the target model. These auxiliary networks, comprising projection layers and ArcFace-based linear layers, refine feature representations to more effectively distinguish between benign and adversarial inputs. Comprehensive experiments across multiple datasets (CIFAR-10, Mammals, and a subset of ImageNet) and architectures (ResNet-50, VGG-16, and ViT) demonstrate that our method surpasses existing unsupervised adversarial detection techniques, achieving superior F1 scores against four distinct attack methods. The proposed framework provides a scalable and effective solution for enhancing the security and reliability of deep learning systems.

Eylon Mizrahi、Raz Lapid、Moshe Sipper

计算技术、计算机技术

Eylon Mizrahi,Raz Lapid,Moshe Sipper.Pulling Back the Curtain: Unsupervised Adversarial Detection via Contrastive Auxiliary Networks[EB/OL].(2025-07-22)[2025-08-16].https://arxiv.org/abs/2502.09110.点此复制

评论