Overfitting in Making Comparisons Between Variable Selection Methods

Juha Reunanen

2003 JMLR JMLR 2003

Overfitting in Making Comparisons Between Variable Selection Methods

Abstract

This paper addresses a common methodological flaw in the comparison of variable selection methods. A practical approach to guide the search or the selection process is to compute cross-validation performance estimates of the different variable subsets. Used with computationally intensive search algorithms, these estimates may overfit and yield biased predictions. Therefore, they cannot be used reliably to compare two selection methods, as is shown by the empirical results of this paper. Instead, like in other instances of the model selection problem, independent test sets should be used for determining the final performance. The claims made in the literature about the superiority of more exhaustive search algorithms over simpler ones are also revisited, and some of them infirmed. [abs] [pdf] [ps.gz] [ps]

📈 Trend Setter — Statistical Learning

🧭 Keyword Pioneer — feature subset

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning

🌱 Topic Pioneer — Evaluation

🐣 Hot Topic Early Bird — model selection

Authors

Juha Reunanen

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Statistical Learning Machine Learning > Optimization & Theory > Theory Machine Learning > Core Methods > Feature Selection Machine Learning > Optimization & Theory > Evaluation

Keywords

model selection variable selection test set feature subset

Download PDF

Related papers

Bottom-Up Relational Learning of Pattern Matching Rules for Information Extraction 2003

An Efficient Boosting Algorithm for Combining Preferences 2003

A Multiscale Framework For Blind Separation of Linearly Mixed Signals 2003

Word-Sequence Kernels 2003

An Extensive Empirical Study of Feature Selection Metrics for Text Classification 2003