Learning Linear-Quadratic Regulators Efficiently with only $\sqrtT$ Regret

Alon Cohen; Tomer Koren; Yishay Mansour

2019 ICML ICML 2019

Learning Linear-Quadratic Regulators Efficiently with only $\sqrtT$ Regret

Abstract

We present the first computationally-efficient algorithm with $\widetilde{O}(\sqrt{T})$ regret for learning in Linear Quadratic Control systems with unknown dynamics. By that, we resolve an open question of Abbasi-Yadkori and Szepesvari (2011) and Dean,Mania, Matni, Recht, and Tu (2018).

🌉 Interdisciplinary Bridge — Artificial Intelligence and Mathematics & Optimization

🐣 Hot Topic Early Bird — optimal control

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Alon Cohen , Tomer Koren , Yishay Mansour

Topics

Artificial Intelligence > Core AI > Agent Systems Mathematics & Optimization > Optimization > Online Algorithms

Keywords

reinforcement learning online learning optimal control linear quadratic regulator regret bound

Download PDF

Related papers

Bayesian leave-one-out cross-validation for large data 2019

A Block Coordinate Descent Proximal Method for Simultaneous Filtering and Parameter Estimation 2019

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks 2019

Beating Stochastic and Adversarial Semi-bandits Optimally and Simultaneously 2019

Improved Convergence for $\ell_1$ and $\ell_∞$ Regression via Iteratively Reweighted Least Squares 2019