Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills

Yevgen Chebotar; Karol Hausman; Yao Lu; Ted Xiao; Dmitry Kalashnikov; Jacob Varley; Alex Irpan; Benjamin Eysenbach; Ryan C Julian; Chelsea Finn; Sergey Levine

2021 ICML ICML 2021

Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills

Abstract

We consider the problem of learning useful robotic skills from previously collected offline data without access to manually specified rewards or additional online exploration, a setting that is becoming increasingly important for scaling robot learning by reusing past robotic data. In particular, we propose the objective of learning a functional understanding of the environment by learning to reach any goal state in a given dataset. We employ goal-conditioned Q-learning with hindsight relabeling and develop several techniques that enable training in a particularly challenging offline setting. We find that our method can operate on high-dimensional camera images and learn a variety of skills on real robots that generalize to previously unseen scenes and objects. We also show that our method can learn to reach long-horizon goals across multiple episodes through goal chaining, and learn rich representations that can help with downstream tasks through pre-training or auxiliary objectives.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — hindsight relabeling

🐣 Hot Topic Early Bird — offline reinforcement learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Yevgen Chebotar , Karol Hausman , Yao Lu , Ted Xiao , Dmitry Kalashnikov , Jacob Varley , Alex Irpan , Benjamin Eysenbach , Ryan C Julian , Chelsea Finn , Sergey Levine

Topics

Machine Learning > Learning Types > Unsupervised Learning Reinforcement Learning > Methods > Offline RL Reinforcement Learning > Applications > Robotics Robotics > Capabilities > Manipulation Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Learning Types > Offline RL

Keywords

unsupervised learning representation learning offline reinforcement learning offline learning goal-conditioned reinforcement learning goal-conditioned learning hindsight relabeling goal reaching robotic skill functional understanding unsupervised offline reinforcement learning goal-conditioned q-learning

Download PDF

Related papers

GRAND: Graph Neural Diffusion 2021

Almost Optimal Anytime Algorithm for Batched Multi-Armed Bandits 2021

Straight to the Gradient: Learning to Use Novel Tokens for Neural Text Generation 2021

Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution 2021

Dataset Dynamics via Gradient Flows in Probability Space 2021