首页|LORE: Lagrangian-Optimized Robust Embeddings for Visual Encoders

LORE: Lagrangian-Optimized Robust Embeddings for Visual Encoders

来源：

英文摘要

Visual encoders have become fundamental components in modern computer vision pipelines. However, ensuring robustness against adversarial perturbations remains a critical challenge. Recent efforts have explored both supervised and unsupervised adversarial fine-tuning strategies. We identify two key limitations in these approaches: (i) they often suffer from instability, especially during the early stages of fine-tuning, resulting in suboptimal convergence and degraded performance on clean data, and (ii) they exhibit a suboptimal trade-off between robustness and clean data accuracy, hindering the simultaneous optimization of both objectives. To overcome these challenges, we propose Lagrangian-Optimized Robust Embeddings (LORE), a novel unsupervised adversarial fine-tuning framework. LORE utilizes constrained optimization, which offers a principled approach to balancing competing goals, such as improving robustness while preserving nominal performance. By enforcing embedding-space proximity constraints, LORE effectively maintains clean data performance throughout adversarial fine-tuning. Extensive experiments show that LORE significantly improves zero-shot adversarial robustness with minimal degradation in clean data accuracy. Furthermore, we demonstrate the effectiveness of the adversarially fine-tuned CLIP image encoder in out-of-distribution generalization and enhancing the interpretability of image embeddings.

作者：Borna Khodabandeh、Amirabbas Afzali、Amirhossein Afsharrad、Seyed Shahabeddin Mousavi、Sanjay Lall、Sajjad Amini、Seyed-Mohsen Moosavi-Dezfooli

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Borna Khodabandeh,Amirabbas Afzali,Amirhossein Afsharrad,Seyed Shahabeddin Mousavi,Sanjay Lall,Sajjad Amini,Seyed-Mohsen Moosavi-Dezfooli.LORE: Lagrangian-Optimized Robust Embeddings for Visual Encoders[EB/OL].(2025-05-24)[2025-06-21].https://arxiv.org/abs/2505.18884.点此复制

LORE: Lagrangian-Optimized Robust Embeddings for Visual Encoders

LORE: Lagrangian-Optimized Robust Embeddings for Visual Encoders

评论