Improving Black-box Adversarial Attacks with a Transfer-based Prior

Shuyu Cheng; Yinpeng Dong; Tianyu Pang; Hang Su; Jun Zhu

2019 NIPS NeurIPS 2019

Improving Black-box Adversarial Attacks with a Transfer-based Prior

Abstract

We consider the black-box adversarial setting, where the adversary has to generate adversarial perturbations without access to the target models to compute gradients. Previous methods tried to approximate the gradient either by using a transfer gradient of a surrogate white-box model, or based on the query feedback. However, these methods often suffer from low attack success rates or poor query efficiency since it is non-trivial to estimate the gradient in a high-dimensional space with limited information. To address these problems, we propose a prior-guided random gradient-free (P-RGF) method to improve black-box adversarial attacks, which takes the advantage of a transfer-based prior and the query information simultaneously. The transfer-based prior given by the gradient of a surrogate model is appropriately integrated into our algorithm by an optimal coefficient derived by a theoretical analysis. Extensive experiments demonstrate that our method requires much fewer queries to attack black-box models with higher success rates compared with the alternative state-of-the-art methods.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Science and Deep Learning and Machine Learning

🧭 Keyword Pioneer — black-box adversarial attack

🐣 Hot Topic Early Bird — gradient estimation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Shuyu Cheng , Yinpeng Dong , Tianyu Pang , Hang Su , Jun Zhu

Topics

Machine Learning > Learning Types > Adversarial Learning Machine Learning > Optimization & Theory > Optimization Computer Science > Applications > Cybersecurity Artificial Intelligence > Core AI > Adversarial Learning Deep Learning > Learning Types > Adversarial Learning

Keywords

transfer learning query efficiency gradient estimation black-box attack adversarial perturbation surrogate model gradient-free optimization black-box adversarial attack transfer-based attack transfer-based prior

Download PDF

Related papers

Two Generator Game: Learning to Sample via Linear Goodness-of-Fit Test 2019

Metalearned Neural Memory 2019

Model Similarity Mitigates Test Set Overuse 2019

Continual Unsupervised Representation Learning 2019

Reinforcement Learning with Convex Constraints 2019