Attribute-efficient learning of decision lists and linear threshold functions under unconcentrated distributions

Philip M. Long; Rocco Servedio

2006 NIPS NeurIPS 2006

Attribute-efficient learning of decision lists and linear threshold functions under unconcentrated distributions

Abstract

We consider the well-studied problem of learning decision lists using few examples when many irrelevant features are present. We show that smooth boosting algorithms such as MadaBoost can efficiently learn decision lists of length k over n boolean variables using poly(k , log n) many examples provided that the marginal distribution over the relevant variables is "not too concentrated" in an L 2 -norm sense. Using a recent result of Hastad, we extend the analysis to obtain a similar (though quantitatively weaker) result for learning arbitrary linear threshold functions with k nonzero coefficients. Experimental results indicate that the use of a smooth boosting algorithm, which plays a crucial role in our analysis, has an impact on the actual performance of the algorithm.

🚀 Conference Pioneer — NIPS 2006

🧭 Keyword Pioneer — attribute-efficient learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning

📈 Trend Setter — Ensemble Learning

🐣 Hot Topic Early Bird — pac learning

Authors

Philip M. Long , Rocco Servedio

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Learning Types > Ensemble Learning Machine Learning > Learning Types > Classification

Keywords

pac learning smooth boosting attribute-efficient learning boolean function decision list linear threshold function boolean variable

Download PDF

Related papers

Temporal Coding using the Response Properties of Spiking Neurons 2006

Parameter Expanded Variational Bayesian Methods 2006

Effects of Stress and Genotype on Meta-parameter Dynamics in Reinforcement Learning 2006

Ordinal Regression by Extended Binary Classification 2006

Blind source separation for over-determined delayed mixtures 2006