Gradient Methods Provably Converge to Non-Robust Networks

Gal Vardi; Gilad Yehudai; Ohad Shamir

2022 NIPS NeurIPS 2022

Gradient Methods Provably Converge to Non-Robust Networks

Abstract

Despite a great deal of research, it is still unclear why neural networks are so susceptible to adversarial examples. In this work, we identify natural settings where depth-$2$ ReLU networks trained with gradient flow are provably non-robust (susceptible to small adversarial $\ell_2$-perturbations), even when robust networks that classify the training dataset correctly exist.Perhaps surprisingly, we show that the well-known implicit bias towards margin maximization induces bias towards non-robust networks, by proving that every network which satisfies the KKT conditions of the max-margin problem is non-robust.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Gal Vardi , Gilad Yehudai , Ohad Shamir

Topics

Machine Learning > Learning Types > Adversarial Learning Machine Learning > Optimization & Theory > Learning Theory Artificial Intelligence > Core AI > Adversarial Learning Deep Learning > Optimization & Theory > Theory Machine Learning > Learning Types > Robustness

Keywords

adversarial robustness gradient descent margin maximization gradient flow implicit bia adversarial example relu network neural network

Download PDF

Related papers

Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching 2022

A Theoretical View on Sparsely Activated Networks 2022

Prune and distill: similar reformatting of image information along rat visual cortex and deep neural networks 2022

Matryoshka Representation Learning 2022

Off-Policy Evaluation with Deficient Support Using Side Information 2022