An Online Learning Approach to Model Predictive Control

Nolan Wagener; Ching-An Cheng; Jacob Sacks; Byron Boots

2019 RSS RSS 2019

An Online Learning Approach to Model Predictive Control

Abstract

Model predictive control (MPC) is a powerful technique for solving dynamic control tasks. In this paper, we show that there exists a close connection between MPC and online learning, an abstract theoretical framework for analyzing online decision making in the optimization literature. This new perspective provides a foundation for leveraging powerful online learning algorithms to design MPC algorithms. Specifically, we propose a new algorithm based on dynamic mirror descent (DMD), an online learning algorithm that is designed for non-stationary setups. Our algorithm, Dynamic Mirror Descent Model Predictive Control (DMD-MPC), represents a general family of MPC algorithms that includes many existing techniques as special instances. DMD-MPC also provides a fresh perspective on previous heuristics used in MPC and suggests a principled way to design new MPC algorithms. In the experimental section of this paper, we demonstrate the flexibility of DMD-MPC, presenting a set of new MPC algorithms on a simple simulated cartpole and a simulated and real-world aggressive driving task. A video of the real-world experiment can be found at https://youtu.be/vZST3v0_S9w.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — non-stationary optimization

🐣 Hot Topic Early Bird — decision making

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Nolan Wagener , Ching-An Cheng , Jacob Sacks , Byron Boots

Topics

Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Online Algorithms

Keywords

online learning decision making model predictive control non-stationary optimization dynamic mirror descent control optimization

Download PDF

Related papers

Online Incremental Learning of the Terrain Traversal Cost in Autonomous Exploration 2019

A 2-Approximation Algorithm for the Online Tethered Coverage Problem 2019

End-To-End Robotic Reinforcement Learning without Reward Engineering 2019

TossingBot: Learning to Throw Arbitrary Objects with Residual Physics 2019

Value Iteration Networks on Multiple Levels of Abstraction 2019