Divide and Conquer Kernel Ridge Regression: A Distributed Algorithm with Minimax Optimal Rates

Yuchen Zhang; John Duchi; Martin Wainwright

2015 JMLR JMLR 2015

Divide and Conquer Kernel Ridge Regression: A Distributed Algorithm with Minimax Optimal Rates

Abstract

We study a decomposition-based scalable approach to kernel ridge regression, and show that it achieves minimax optimal convergence rates under relatively mild conditions. The method is simple to describe: it randomly partitions a dataset of size $N$ into $m$ subsets of equal size, computes an independent kernel ridge regression estimator for each subset using a careful choice of the regularization parameter, then averages the local solutions into a global predictor. This partitioning leads to a substantial reduction in computation time versus the standard approach of performing kernel ridge regression on all $N$ samples. Our two main theorems establish that despite the computational speed-up, statistical optimality is retained: as long as $m$ is not too large, the partition-based estimator achieves the statistical minimax rate over all estimators using the set of $N$ samples. As concrete examples, our theory guarantees that the number of subsets $m$ may grow nearly linearly for finite-rank or Gaussian kernels and polynomially in $N$ for Sobolev spaces, which in turn allows for substantial reductions in computational cost. We conclude with experiments on both simulated data and a music-prediction task that complement our theoretical results, exhibiting the computational and statistical benefits of our approach. [abs] [ pdf ][ bib ] © JMLR 2015. (edit, beta)

🐣 Hot Topic Early Bird — distributed learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yuchen Zhang , John Duchi , Martin Wainwright

Topics

Machine Learning > Core Methods > Regression Machine Learning > Optimization & Theory > Distributed Learning Machine Learning > Optimization & Theory > Optimization

Keywords

distributed learning kernel ridge regression divide and conquer minimax optimal rate scalable algorithm

Download PDF

Related papers

The Sample Complexity of Learning Linear Predictors with the Squared Loss 2015

Preface to this Special Issue 2015

Fast Cross-Validation via Sequential Testing 2015

Online Tensor Methods for Learning Latent Variable Models 2015

CEKA: A Tool for Mining the Wisdom of Crowds 2015