A Reinforcement Learning Look at Risk-Sensitive Linear Quadratic Gaussian Control

Leilei Cui; Tamer Basar; Zhong-Ping Jiang

2023 L4DC L4DC 2023

A Reinforcement Learning Look at Risk-Sensitive Linear Quadratic Gaussian Control

Abstract

In this paper, we propose a robust reinforcement learning method for a class of linear discrete-time systems to handle model mismatches that may be induced by sim-to-real gap. Under the formulation of risk-sensitive linear quadratic Gaussian control, a dual-loop policy optimization algorithm is proposed to iteratively approximate the robust and optimal controller. The convergence and robustness of the dual-loop policy optimization algorithm are rigorously analyzed. It is shown that the dual-loop policy optimization algorithm uniformly converges to the optimal solution. In addition, by invoking the concept of small-disturbance input-to-state stability, it is guaranteed that the dual-loop policy optimization algorithm still converges to a neighborhood of the optimal solution when the algorithm is subject to a sufficiently small disturbance at each step. When the system matrices are unknown, a learning-based off-policy policy optimization algorithm is proposed for the same class of linear systems with additive Gaussian noise. The numerical simulation is implemented to demonstrate the efficacy of the proposed algorithm.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Leilei Cui , Tamer Basar , Zhong-Ping Jiang

Topics

Machine Learning > Optimization & Theory > Optimization Reinforcement Learning > Methods > Policy Learning Reinforcement Learning > Applications > Robotics

Keywords

reinforcement learning policy optimization risk-sensitive control robust control linear quadratic gaussian

Download PDF

Related papers

Model-Based Reinforcement Learning for Cavity Filter Tuning 2023

Learning on Manifolds: Universal Approximations Properties using Geometric Controllability Conditions for Neural ODEs 2023

Hyperparameter Tuning of an Off-Policy Reinforcement Learning Algorithm for H∞ Tracking Control 2023

Policy Learning for Active Target Tracking over Continuous $SE(3)$ Trajectories 2023

Automated Reachability Analysis of Neural Network-Controlled Systems via Adaptive Polytopes 2023