Variational Inference MPC for Bayesian Model-based Reinforcement Learning

Masashi Okada; Tadahiro Taniguchi

2019 CORL CoRL 2019

Variational Inference MPC for Bayesian Model-based Reinforcement Learning

Abstract

In recent studies on model-based reinforcement learning (MBRL), incorporating uncertainty in forward dynamics is a state-of-the-art strategy to enhance learning performance, making MBRLs competitive to cutting-edge modelfree methods, especially in simulated robotics tasks. Probabilistic ensembles with trajectory sampling (PETS) is a leading type of MBRL, which employs Bayesian inference to dynamics modeling and model predictive control (MPC) with stochastic optimization via the cross entropy method (CEM). In this paper, we propose a novel extension to the uncertainty-aware MBRL. Our main contributions are twofold: Firstly, we introduce a variational inference MPC (VI-MPC), which reformulates various stochastic methods, including CEM, in a Bayesian fashion. Secondly, we propose a novel instance of the framework, called probabilistic action ensembles with trajectory sampling (PaETS). As a result, our Bayesian MBRL can involve multimodal uncertainties both in dynamics and optimal trajectories. In comparison to PETS, our method consistently improves asymptotic performance on several challenging locomotion tasks.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🧭 Keyword Pioneer — cross entropy method

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

🐣 Hot Topic Early Bird — model predictive control

Authors

Masashi Okada , Tadahiro Taniguchi

Topics

Artificial Intelligence > Core AI > Agent Systems Reinforcement Learning > Methods > Deep RL Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Bayesian & Probabilistic > Bayesian Inference Machine Learning > Bayesian & Probabilistic > Variational Inference

Keywords

variational inference bayesian inference model predictive control model-based reinforcement learning cross entropy method trajectory sampling probabilistic ensemble

Download PDF

Related papers

On-Policy Robot Imitation Learning from a Converging Supervisor 2019

Learning by Cheating 2019

Object-centric Forward Modeling for Model Predictive Control 2019

Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real 2019

Combining Deep Learning and Verification for Precise Object Instance Detection 2019