Cross-validation Confidence Intervals for Test Error

Pierre Bayle; Alexandre Bayle; Lucas Janson; Lester W. Mackey

2020 NIPS NeurIPS 2020

Cross-validation Confidence Intervals for Test Error

Abstract

This work develops central limit theorems for cross-validation and consistent estimators of the asymptotic variance under weak stability conditions on the learning algorithm. Together, these results provide practical, asymptotically-exact confidence intervals for k-fold test error and valid, powerful hypothesis tests of whether one learning algorithm has smaller k-fold test error than another. These results are also the first of their kind for the popular choice of leave-one-out cross-validation. In our experiments with diverse learning algorithms, the resulting intervals and tests outperform the most popular alternative methods from the literature.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Pierre Bayle , Alexandre Bayle , Lucas Janson , Lester W. Mackey

Topics

Machine Learning > Core Methods > Regression Machine Learning > Optimization & Theory > Statistical Learning Machine Learning > Optimization & Theory > Statistics Machine Learning > Optimization & Theory > Evaluation Machine Learning > Learning Types > Evaluation

Keywords

hypothesis testing asymptotic variance test error confidence interval hypothesis test leave-one-out cross-validation

Download PDF

Related papers

Higher-Order Spectral Clustering of Directed Graphs 2020

Self-Supervised MultiModal Versatile Networks 2020

Multi-Robot Collision Avoidance under Uncertainty with Probabilistic Safety Barrier Certificates 2020

Causal Intervention for Weakly-Supervised Semantic Segmentation 2020

Taming Discrete Integration via the Boon of Dimensionality 2020