Model-Based Reinforcement Learning via Latent-Space Collocation

Oleh Rybkin; Chuning Zhu; Anusha Nagabandi; Kostas Daniilidis; Igor Mordatch; Sergey Levine

2021 ICML ICML 2021

Model-Based Reinforcement Learning via Latent-Space Collocation

Abstract

The ability to plan into the future while utilizing only raw high-dimensional observations, such as images, can provide autonomous agents with broad and general capabilities. However, realistic tasks require performing temporally extended reasoning, and cannot be solved with only myopic, short-sighted planning. Recent work in model-based reinforcement learning (RL) has shown impressive results on tasks that require only short-horizon reasoning. In this work, we study how the long-horizon planning abilities can be improved with an algorithm that optimizes over sequences of states, rather than actions, which allows better credit assignment. To achieve this, we draw on the idea of collocation and adapt it to the image-based setting by leveraging probabilistic latent variable models, resulting in an algorithm that optimizes trajectories over latent variables. Our latent collocation method (LatCo) provides a general and effective visual planning approach, and significantly outperforms prior model-based approaches on challenging visual control tasks with sparse rewards and long-term goals. See the videos on the supplementary website \url{https://sites.google.com/view/latco-mbrl/.}

🌉 Interdisciplinary Bridge — Deep Learning and Reinforcement Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Oleh Rybkin , Chuning Zhu , Anusha Nagabandi , Kostas Daniilidis , Igor Mordatch , Sergey Levine

Topics

Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Applications > Robotics Deep Learning > Learning Types > Reinforcement Learning

Keywords

trajectory optimization model-based reinforcement learning latent variable model latent space visual planning latent space planning visual control

Download PDF

Related papers

GRAND: Graph Neural Diffusion 2021

Almost Optimal Anytime Algorithm for Batched Multi-Armed Bandits 2021

Straight to the Gradient: Learning to Use Novel Tokens for Neural Text Generation 2021

Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution 2021

Dataset Dynamics via Gradient Flows in Probability Space 2021