Improving the transferability of adversarial attacks via self-ensemble

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Abstract: Deep neural networks have been used extensively for diverse visual tasks, including object detection, face recognition, and image classification. However, they face several security threats, such as adversarial attacks. To improve the resistance of neural networks to adversarial attacks, researchers have investigated the security issues of models from the perspectives of both attacks and defenses. Recently, the transferability of adversarial attacks has received extensive attention, which promotes the application of adversarial attacks in practical scenarios. However, existing transferable attacks tend to trap into a poor local optimum and significantly degrade the transferability because the production of adversarial samples lacks randomness. Therefore, we propose a self-ensemble-based feature-level adversarial attack (SEFA) to boost transferability by randomly disrupting salient features. We provide theoretical analysis to demonstrate the superiority of the proposed method. In particular, perturbing the refined feature importance weighted intermediate features suppresses positive features and encourages negative features to realize adversarial attacks. Subsequently, self-ensemble is introduced to solve the optimization problem, thus enhancing the diversity from an optimization perspective. The diverse orthogonal initial perturbations disrupt these features stochastically, searching the space of transferable perturbations exhaustively to avoid poor local optima and improve transferability effectively. Extensive experiments show the effectiveness and superiority of the proposed SEFA, i.e., the success rates against undefended models and defense models are improved by 7.7% and 13.4%, respectively, compared with existing transferable attacks. Our code is available at https://github.com/chengshuyan/SEFA. Graphical abstract: (Figure presented.)

Original languageEnglish
Pages (from-to)10608-10626
Number of pages19
JournalApplied Intelligence
Volume54
Issue number21
DOIs
StatePublished - Nov 2024

Keywords

  • Adversarial examples
  • Black-box attacks
  • Feature importance
  • Self-ensemble
  • Transferability

Fingerprint

Dive into the research topics of 'Improving the transferability of adversarial attacks via self-ensemble'. Together they form a unique fingerprint.

Cite this