Entity Abstraction in Visual Model-Based Reinforcement Learning

Rishi Veerapaneni; John D. Co-Reyes; Michael Chang; Michael Janner; Chelsea Finn; Jiajun Wu; Joshua Tenenbaum; Sergey Levine

2019 CORL CoRL 2019

Entity Abstraction in Visual Model-Based Reinforcement Learning

Abstract

We present OP3, a framework for model-based reinforcement learning that acquires object representations from raw visual observations without supervision and uses them to predict and plan. To ground these abstract representations of entities to actual objects in the world, we formulate an interactive inference algorithm which incorporates dynamic information in the scene. Our model can handle a variable number of entities by symmetrically processing each object representation with the same locally-scoped function. On block-stacking tasks, OP3 can generalize to novel block configurations and more objects than seen during training, outperforming both a model that assumes access to object supervision and a state-of-the-art video prediction model.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Reinforcement Learning

🧭 Keyword Pioneer — visual model-based reinforcement learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Knowledge & Reasoning, Machine Learning, Natural Language Processing, Reinforcement Learning, Robotics

Authors

Rishi Veerapaneni , John D. Co-Reyes , Michael Chang , Michael Janner , Chelsea Finn , Jiajun Wu , Joshua Tenenbaum , Sergey Levine

Topics

Artificial Intelligence > Core AI > Multimodal Learning Reinforcement Learning > Methods > Deep RL Computer Vision > Core AI > Computer Vision

Keywords

model-based planning visual observation object representation visual model-based reinforcement learning entity abstraction block stacking

Download PDF

Related papers

On-Policy Robot Imitation Learning from a Converging Supervisor 2019

Learning by Cheating 2019

Object-centric Forward Modeling for Model Predictive Control 2019

Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real 2019

Combining Deep Learning and Verification for Precise Object Instance Detection 2019