A Compression Approach to Support Vector Model Selection

Ulrike Von Luxburg; Olivier Bousquet; Bernhard
  Schölkopf

2004 JMLR JMLR 2004

A Compression Approach to Support Vector Model Selection

Abstract

In this paper we investigate connections between statistical learning theory and data compression on the basis of support vector machine (SVM) model selection. Inspired by several generalization bounds we construct "compression coefficients" for SVMs which measure the amount by which the training labels can be compressed by a code built from the separating hyperplane. The main idea is to relate the coding precision to geometrical concepts such as the width of the margin or the shape of the data in the feature space. The so derived compression coefficients combine well known quantities such as the radius-margin term R 2 /ρ 2 , the eigenvalues of the kernel matrix, and the number of support vectors. To test whether they are useful in practice we ran model selection experiments on benchmark data sets. As a result we found that compression coefficients can fairly accurately predict the parameters for which the test error is minimized. [abs] [ pdf ][ ps.gz ][ ps ]

📈 Trend Setter — Statistical Learning

🐣 Hot Topic Early Bird — generalization bound

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

🌱 Topic Pioneer — Information Theory

Authors

Ulrike Von Luxburg , Olivier Bousquet , Bernhard Schölkopf

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Optimization & Theory > Statistical Learning Machine Learning > Optimization & Theory > Information Theory Machine Learning > Core Methods > Support Vector Machine

Keywords

model selection statistical learning theory data compression kernel matrix support vector machine generalization bound

Download PDF

Related papers

Selective Rademacher Penalization and Reduced Error Pruning of Decision Trees 2004

Fast String Kernels using Inexact Matching for Protein Sequences 2004

Learning the Kernel Matrix with Semidefinite Programming 2004

Weather Data Mining Using Independent Component Analysis 2004

A Geometric Approach to Multi-Criterion Reinforcement Learning 2004