Improving transferability of adversarial examples via statistical attribution-based attacks.

Journal: Neural networks : the official journal of the International Neural Network Society

PMID: 40086136

Abstract

Adversarial attacks are significant in uncovering vulnerabilities and assessing the robustness of deep neural networks (DNNs), offering profound insights into their internal mechanisms. Feature-level attacks, a potent approach, craft adversarial examples by extensively corrupting the intermediate-layer features of the source model during each iteration. However, it often has imprecise metrics to assess the significance of features and may impose constraints on the transferability of adversarial examples. To address these issues, this paper introduces the Statistical Attribution-based Attack (SAA) method, which emphasizes finding feature importance representations and refining optimization objectives, thereby achieving stronger attack performance. To calculate the Comprehensive Gradient for more accurate feature representation, we introduce the Region-wise Feature Disturbance and Gradient Information Aggregation, which can effectively disrupt the model's attention focus areas. Subsequently, a statistical attribution-based approach is employed, leveraging the average feature information across layers to provide a more advantageous optimization objective. Experiments have validated the superiority of this method. Specifically, SAA improves the attack success rate by 9.3% compared with the second-best method. When combined with input transformation methods, it achieves an average success rate of 79.2% against eight leading defense models.

Authors

Hegui Zhu

College of Sciences, Northeastern University, Shenyang, 110819, China; Foshan Graduate School of Innovation, Northeastern University, Foshan, 528311, China. Electronic address: zhuhegui@mail.neu.edu.cn.
Yanmeng Jia

College of Sciences, Northeastern University, Shenyang, 110819, China.
Yue Yan

Department of Biomedical Engineering, Penn State University, University Park, Pennsylvania 16802, United States.
Ze Yang

State Key Laboratory of Diarrhea Disease Detection, Zhuhai International Travel Healthcare Center, Zhuhai Entry-Exit Inspection and Quarantine Bureau, Zhuhai 519020, Guangdong, PR China.

Keywords

Algorithms Computer Security Deep Learning Humans Neural Networks, Computer

External Resources

View on PubMed Access via DOI PubMed (40086136)

Improving transferability of adversarial examples via statistical attribution-based attacks.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals