Spherical Random Features for Polynomial Kernels

Jeffrey Pennington; Felix Xinnan X Yu; Sanjiv Kumar

2015 NIPS NeurIPS 2015

Spherical Random Features for Polynomial Kernels

Abstract

Compact explicit feature maps provide a practical framework to scale kernel methods to large-scale learning, but deriving such maps for many types of kernels remains a challenging open problem. Among the commonly used kernels for nonlinear classification are polynomial kernels, for which low approximation error has thus far necessitated explicit feature maps of large dimensionality, especially for higher-order polynomials. Meanwhile, because polynomial kernels are unbounded, they are frequently applied to data that has been normalized to unit l2 norm. The question we address in this work is: if we know a priori that data is so normalized, can we devise a more compact map? We show that a putative affirmative answer to this question based on Random Fourier Features is impossible in this setting, and introduce a new approximation paradigm, Spherical Random Fourier (SRF) features, which circumvents these issues and delivers a compact approximation to polynomial kernels for data on the unit sphere. Compared to prior work, SRF features are less rank-deficient, more compact, and achieve better kernel approximation, especially for higher-order polynomials. The resulting predictions have lower variance and typically yield better classification accuracy.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

📈 Trend Setter — Kernel Methods

🧭 Keyword Pioneer — spherical random feature

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jeffrey Pennington , Felix Xinnan X Yu , Sanjiv Kumar

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Mathematics > Linear Algebra Machine Learning > Core Methods > Kernel Methods Machine Learning > Bayesian & Probabilistic > Kernel Methods

Keywords

kernel approximation nonlinear classification feature map random fourier feature polynomial kernel kernel methods spherical random feature

Download PDF

Related papers

Data Generation as Sequential Decision Making 2015

A Recurrent Latent Variable Model for Sequential Data 2015

Combinatorial Cascading Bandits 2015

Accelerated Mirror Descent in Continuous and Discrete Time 2015

Matrix Completion with Noisy Side Information 2015