Abstraction-based branch and bound approach to Q-learning for hybrid optimal control

Benoît Legat; Raphaël M. Jungers; Jean Bouchat

2021 L4DC L4DC 2021

Abstraction-based branch and bound approach to Q-learning for hybrid optimal control

Abstract

In this paper, we design a theoretical framework allowing to apply model predictive control on hybrid systems. For this, we develop a theory of approximate dynamic programming by leveraging the concept of alternating simulation. We show how to combine these notions in a branch and bound algorithm that can further refine the Q-functions using Lagrangian duality. We illustrate the approach on a numerical example.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization and Reinforcement Learning and Robotics

🧭 Keyword Pioneer — hybrid optimal control

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Benoît Legat , Raphaël M. Jungers , Jean Bouchat

Topics

Machine Learning > Optimization & Theory > Optimization Reinforcement Learning > Methods > Deep RL Robotics > Systems > Control Systems Mathematics & Optimization > Optimization > Optimal Control

Keywords

reinforcement learning branch and bound algorithm model predictive control branch and bound optimal control approximate dynamic programming hybrid system hybrid optimal control alternating simulation

Download PDF

Related papers

Data-driven design of switching reference governors for brake-by-wire applications 2021

Learning local modules in dynamic networks 2021

Certainty Equivalent Perception-Based Control 2021

Sample Complexity of Linear Quadratic Gaussian (LQG) Control for Output Feedback Systems 2021

Provably Sample Efficient Reinforcement Learning in Competitive Linear Quadratic Systems 2021