Feature Selection with Ensembles, Artificial Variables, and Redundancy Elimination

Eugene Tuv; Alexander Borisov; George Runger; Kari Torkkola

2009 JMLR JMLR 2009

Feature Selection with Ensembles, Artificial Variables, and Redundancy Elimination

Abstract

Predictive models benefit from a compact, non-redundant subset of features that improves interpretability and generalization. Modern data sets are wide, dirty, mixed with both numerical and categorical predictors, and may contain interactive effects that require complex models. This is a challenge for filters, wrappers, and embedded feature selection methods. We describe details of an algorithm using tree-based ensembles to generate a compact subset of non-redundant features. Parallel and serial ensembles of trees are combined into a mixed method that can uncover masking and detect features of secondary effect. Simulated and actual examples illustrate the effectiveness of the approach. [abs] [ pdf ][ bib ] © JMLR 2009. (edit, beta)

🧭 Keyword Pioneer — tree-based ensemble

🐣 Hot Topic Early Bird — feature selection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Eugene Tuv , Alexander Borisov , George Runger , Kari Torkkola

Topics

Machine Learning > Core Methods > Classification Machine Learning > Application Areas > Domain Adaptation

Keywords

feature selection tree-based ensemble ensemble method predictive model redundancy elimination

Download PDF

Related papers

Subgroup Analysis via Recursive Partitioning 2009

A New Approach to Collaborative Filtering: Operator Estimation with Spectral Regularization 2009

An Analysis of Convex Relaxations for MAP Estimation of Discrete MRFs 2009

Nonextensive Information Theoretic Kernels on Measures 2009

The Hidden Life of Latent Variables: Bayesian Learning with Mixed Graph Models 2009