Leverage Score Sampling for Faster Accelerated Regression and ERM

Naman Agarwal; Sham Kakade; Rahul Kidambi; Yin-Tat Lee; Praneeth Netrapalli; Aaron Sidford

2020 ALT ALT 2020

Leverage Score Sampling for Faster Accelerated Regression and ERM

Abstract

Given a matrix $\mathbf{A}\in\R^{n\times d}$ and a vector $b\in\R^{d}$, we show how to compute an $\epsilon$-approximate solution to the regression problem $ \min_{x\in\R^{d}}\frac{1}{2} \norm{\mathbf{A} x-b}_{2}^{2} $ in time $ \widetilde{O} ((n+\sqrt{d\cdot\kappa_{\text{sum}}}) s \log\epsilon^{-1}) $ where $\kappa_{\text{sum}}=\tr\left(\mathbf{A}^{\top}\mathbf{A}\right)/\lambda_{\min}(\mathbf{A}^{\top}\mathbf{A})$ and $s$ is the maximum number of non-zero entries in a row of $\mathbf{A}$. This improves upon the previous best running time of $ \widetilde{O} ((n+\sqrt{n \cdot\kappa_{\text{sum}}}) s \log\epsilon^{-1})$. We achieve our result through an interesting combination of leverage score sampling, proximal point methods, and accelerated coordinate descent methods. Further, we show that our method not only matches the performance of previous methods up to polylogarithmic factors, but further improves whenever leverage scores of rows are small. We also provide a non-linear generalization of these results that improves the running time for solving a broader class of ERM problems and expands the set of ERM problems provably solvable in nearly linear time.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Naman Agarwal , Sham Kakade , Rahul Kidambi , Yin-Tat Lee , Praneeth Netrapalli , Aaron Sidford

Topics

Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Continuous Optimization

Keywords

empirical risk minimization leverage score sampling proximal point method matrix regression accelerated coordinate descent

Download PDF

Related papers

On Learnability wih Computable Learners 2020

Interactive Learning of a Dynamic Structure 2020

A Non-Trivial Algorithm Enumerating Relevant Features over Finite Fields 2020

Finding Robust Nash equilibria 2020

Approximate Representer Theorems in Non-reflexive Banach Spaces 2020