2021
NIPS
NeurIPS 2021
Interpolation can hurt robust generalization even when there is no noise
Abstract
Numerous recent works show that overparameterization implicitly reduces variance for min-norm interpolators and max-margin classifiers. These findings suggest that ridge regularization has vanishing benefits in high dimensions. We challenge this narrative by showing that, even in the absence of noise, avoiding interpolation through ridge regularization can significantly improve generalization. We prove this phenomenon for the robust risk of both linear regression and classification, and hence provide the first theoretical result on \emph{robust overfitting}.
🧭
Keyword Pioneer
— robust overfitting
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Machine Learning, Mathematics & Optimization, Natural Language Processing
Authors
Topics
Machine Learning > Core Methods > Classification
Machine Learning > Core Methods > Regression
Machine Learning > Optimization & Theory > Learning Theory
Machine Learning > Learning Types > Regularization
Machine Learning > Learning Types > Robustness
Machine Learning > Optimization & Theory > Generalization