|国家预印本平台
首页|基于边界混淆的决策对抗攻击防御方法

基于边界混淆的决策对抗攻击防御方法

efense Method of Decision-based Adversarial Attack Based on Obfuscation Near the Boundary

中文摘要英文摘要

黑盒决策攻击是针对深度神经网络模型的一种高效的攻击方法,且近年来出现了一批查询成本低且攻击效果优秀的算法,为解决缺少针对性防御方法的问题,提出了一种基于边界混淆的决策对抗攻击防御方法--ONBD (Obfuscation Near the Boundary Defence)。该防御方法利用检验输出一致性的方法判别样本是否位于决策边界附近,之后对位于决策边界附近的样本预测结果进行基于规则的随机变换,以此混淆攻击方获得的决策边界信息,从而妨碍决策攻击算法的正常运行。在ImageNet数据集上的仿真实验结果表明,ONBD的干净样本准确率以及在Boundary Attack、QEBA、SurFree三种决策攻击下的防御效果均优于SND防御方法。实验结果表明,基于边界混淆的防御方法可以在几乎不损失分类准确率的条件下对黑盒决策攻击进行有效防御。

ecision-based black box attack is an efficient attack method against the deep neural network model, and in recent years, a number of algorithms with low query cost and excellent attack effect have emerged. To solve the problem of lack of targeted defense methods, a defense method based on boundary confusion, namely ONBD (Obstruction Near the Boundary Defense), was proposed. This defense method used the method of testing the consistency of output to determine whether the samples are located near the decision boundary, and then randomly transformed the prediction results of the samples located near the decision boundary based on rules, so as to confuse the decision boundary information obtained by the attack algorithms, thus hindering the normal operation of the decision-based attack algorithm. The simulation results on ImageNet dataset demonstrated that ONBD is superior to SND in terms of the classification accuracy of clean samples and the defense effect under the three decision-based attacks of Boundary Attack, QEBA and SurFree. The experimental results showed that the defense method based on boundary confusion can effectively defend against black box decision attacks with almost no loss of classification accuracy on clean samples.

伍淳华、潘立昊

计算技术、计算机技术

深度神经网络对抗攻击对抗攻击防御对抗样本决策边界

deep neural networkadversarial attackadversarial defenseadversarial examplesdecision boundary

伍淳华,潘立昊.基于边界混淆的决策对抗攻击防御方法[EB/OL].(2023-02-09)[2025-08-11].http://www.paper.edu.cn/releasepaper/content/202302-46.点此复制

评论