Interpolation can hurt robust generalization even when there is no noise

Konstantin Donhauser; Alexandru Tifrea; Michael Aerni; Reinhard Heckel; Fanny Yang

2021 NIPS NeurIPS 2021

Interpolation can hurt robust generalization even when there is no noise

Abstract

Numerous recent works show that overparameterization implicitly reduces variance for min-norm interpolators and max-margin classifiers. These findings suggest that ridge regularization has vanishing benefits in high dimensions. We challenge this narrative by showing that, even in the absence of noise, avoiding interpolation through ridge regularization can significantly improve generalization. We prove this phenomenon for the robust risk of both linear regression and classification, and hence provide the first theoretical result on \emph{robust overfitting}.

🧭 Keyword Pioneer — robust overfitting

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Machine Learning, Mathematics & Optimization, Natural Language Processing

Authors

Konstantin Donhauser , Alexandru Tifrea , Michael Aerni , Reinhard Heckel , Fanny Yang

Topics

Machine Learning > Core Methods > Classification Machine Learning > Core Methods > Regression Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Learning Types > Regularization Machine Learning > Learning Types > Robustness Machine Learning > Optimization & Theory > Generalization

Keywords

robust overfitting robust generalization ridge regularization max-margin classifier min-norm interpolator

Download PDF

Related papers

Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data 2021

On Model Calibration for Long-Tailed Object Detection and Instance Segmentation 2021

Test-Time Personalization with a Transformer for Human Pose Estimation 2021

NTopo: Mesh-free Topology Optimization using Implicit Neural Representations 2021

Scalable Intervention Target Estimation in Linear Models 2021