LEOC: A Principled Method in Integrating Reinforcement Learning and Classical Control Theory

Naifu Zhang; Nicholas Capel

2021 L4DC L4DC 2021

LEOC: A Principled Method in Integrating Reinforcement Learning and Classical Control Theory

Abstract

There have been attempts in reinforcement learning to exploit a priori knowledge about the structure of the system. This paper proposes a hybrid reinforcement learning controller which dynamically interpolates a model-based linear controller and an arbitrary differentiable policy. The linear controller is designed based on local linearised model knowledge, and stabilises the system in a neighbourhood about an operating point. The coefficients of interpolation between the two controllers are determined by a scaled distance function measuring the distance between the current state and the operating point. The overall hybrid controller is proven to maintain the stability guarantee around the neighborhood of the operating point and still possess the universal function approximation property of the arbitrary non-linear policy. Learning has been done on both model-based (PILCO) and model-free (DDPG) frameworks. Simulation experiments performed in OpenAI gym demonstrate stability and robustness of the proposed hybrid controller. This paper thus introduces a principled method allowing for the direct importing of control methodology into reinforcement learning.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning and Robotics

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

🧭 Keyword Pioneer — classical control theory

Authors

Naifu Zhang , Nicholas Capel

Topics

Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Applications > Robotics Robotics > Systems > Control Systems Machine Learning > Learning Types > Reinforcement Learning Artificial Intelligence > Core AI > Robotics

Keywords

reinforcement learning policy learning linear quadratic regulator model-based control stability guarantee hybrid control hybrid controller classical control theory linear controller

Download PDF

Related papers

Abstraction-based branch and bound approach to Q-learning for hybrid optimal control 2021

Data-driven design of switching reference governors for brake-by-wire applications 2021

Learning local modules in dynamic networks 2021

Certainty Equivalent Perception-Based Control 2021

Sample Complexity of Linear Quadratic Gaussian (LQG) Control for Output Feedback Systems 2021