Minimally distorted Adversarial Examples with a Fast Adaptive Boundary Attack

Francesco Croce; Matthias Hein

2020 ICML ICML 2020

Minimally distorted Adversarial Examples with a Fast Adaptive Boundary Attack

Abstract

The evaluation of robustness against adversarial manipulation of neural networks-based classifiers is mainly tested with empirical attacks as methods for the exact computation, even when available, do not scale to large networks. We propose in this paper a new white-box adversarial attack wrt the $l_p$-norms for $p \in \{1,2,\infty\}$ aiming at finding the minimal perturbation necessary to change the class of a given input. It has an intuitive geometric meaning, yields quickly high quality results, minimizes the size of the perturbation (so that it returns the robust accuracy at every threshold with a single run). It performs better or similar to state-of-the-art attacks which are partially specialized to one $l_p$-norm, and is robust to the phenomenon of gradient obfuscation.

🧭 Keyword Pioneer — perturbation minimization

🐣 Hot Topic Early Bird — adversarial robustness

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Francesco Croce , Matthias Hein

Topics

Machine Learning > Core Methods > Classification Machine Learning > Learning Types > Adversarial Learning Machine Learning > Optimization & Theory > Optimization

Keywords

adversarial robustness adversarial attack adversarial example neural network classifier white-box attack perturbation minimization

Download PDF

Related papers

Correlation Clustering with Asymmetric Classification Errors 2020

Learning Portable Representations for High-Level Planning 2020

Proving the Lottery Ticket Hypothesis: Pruning is All You Need 2020

Minimax Pareto Fairness: A Multi Objective Perspective 2020

DeepMatch: Balancing Deep Covariate Representations for Causal Inference Using Adversarial Training 2020