Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality

Yi Zhang; Orestis Plevrakis; Simon S Du; Xingguo Li; Zhao Song; Sanjeev Arora

2020 NIPS NeurIPS 2020

Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality

Abstract

Adversarial training is a popular method to give neural nets robustness against adversarial perturbations. In practice adversarial training leads to low robust training loss. However, a rigorous explanation for why this happens under natural conditions is still missing. Recently a convergence theory of standard (non-adversarial) supervised training was developed by various groups for {\em very overparametrized} nets. It is unclear how to extend these results to adversarial training because of the min-max objective. Recently, a first step towards this direction was made by Gao et al. using tools from online learning, but they require the width of the net to be \emph{exponential} in input dimension $d$, and with an unnatural activation function. Our work proves convergence to low robust training loss for \emph{polynomial} width instead of exponential, under natural assumptions and with ReLU activations. A key element of our proof is showing that ReLU networks near initialization can approximate the step function, which may be of independent interest.

🧭 Keyword Pioneer — robust training loss

🐣 Hot Topic Early Bird — convergence guarantee

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yi Zhang , Orestis Plevrakis , Simon S Du , Xingguo Li , Zhao Song , Sanjeev Arora

Topics

Machine Learning > Learning Types > Adversarial Learning Machine Learning > Optimization & Theory > Neural Network Optimization Machine Learning > Optimization & Theory > Theory

Keywords

robust optimization convergence analysis adversarial training neural network optimization convergence guarantee relu activation over-parameterized network relu network neural network robust training loss

Download PDF

Related papers

Higher-Order Spectral Clustering of Directed Graphs 2020

Self-Supervised MultiModal Versatile Networks 2020

Multi-Robot Collision Avoidance under Uncertainty with Probabilistic Safety Barrier Certificates 2020

Causal Intervention for Weakly-Supervised Semantic Segmentation 2020

Taming Discrete Integration via the Boon of Dimensionality 2020