The Feature Selection Path in Kernel Methods

Fuxin Li; Cristian Sminchisescu

2010 AISTATS AISTATS 2010

The Feature Selection Path in Kernel Methods

Abstract

The problem of automatic feature selection/weighting in kernel methods is examined. We work on a formulation that optimizes both the weights of features and the parameters of the kernel model simultaneously, using $L_1$ regularization for feature selection. Under quite general choices of kernels, we prove that there exists a unique regularization path for this problem, that runs from 0 to a stationary point of the non-regularized problem. We propose an ODE-based homotopy method to follow this trajectory. By following the path, our algorithm is able to automatically discard irrelevant features and to automatically go back and forth to avoid local optima. Experiments on synthetic and real datasets show that the method achieves low prediction error and is efficient in separating relevant from irrelevant features.

🚀 Conference Pioneer — AISTATS 2010

🧭 Keyword Pioneer — homotopy continuation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🐣 Hot Topic Early Bird — feature selection

Authors

Fuxin Li , Cristian Sminchisescu

Topics

Machine Learning > Core Methods > Regression Machine Learning > Core Methods > Representation Learning Machine Learning > Optimization & Theory > Optimization Machine Learning > Core Methods > Feature Selection Machine Learning > Core Methods > Kernel Methods Mathematics & Optimization > Optimization > Sparse Optimization

Keywords

feature selection l1 regularization sparse optimization regularization path homotopy method homotopy continuation kernel methods

Download PDF

Related papers

Towards Understanding Situated Natural Language 2010

Mass Fatality Incident Identification based on nuclear DNA evidence 2010

Locally Linear Denoising on Image Manifolds 2010

Negative Results for Active Learning with Convex Losses 2010

Collaborative Filtering on a Budget 2010