Reverse Curriculum Generation for Reinforcement Learning

Carlos Florensa; David Held; Markus Wulfmeier; Michael Zhang; Pieter Abbeel

2017 CORL CoRL 2017

Reverse Curriculum Generation for Reinforcement Learning

Abstract

Many relevant tasks require an agent to reach a certain state, or to manipulate objects into a desired configuration. For example, we might want a robot to align and assemble a gear onto an axle or insert and turn a key in a lock. These goal-oriented tasks present a considerable challenge for reinforcement learning, since their natural reward function is sparse and prohibitive amounts of exploration are required to reach the goal and receive some learning signal. Past approaches tackle these problems by exploiting expert demonstrations or by manually designing a task-specific reward shaping function to guide the learning agent. Instead, we propose a method to learn these tasks without requiring any prior knowledge other than obtaining a single state in which the task is achieved. The robot is trained in “reverse", gradually learning to reach the goal from a set of starting positions increasingly far from the goal. Our method automatically generates a curriculum of starting positions that adapts to the agent’s performance, leading to efficient training on goal-oriented tasks. We demonstrate our approach on difficult simulated navigation and fine-grained manipulation problems, not solvable by state-of-the-art reinforcement learning methods.

🚀 Conference Pioneer — CORL 2017

🌱 Topic Pioneer — Curriculum Learning

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning and Robotics

🧭 Keyword Pioneer — goal-conditioned learning

🐣 Hot Topic Early Bird — reinforcement learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

📈 Trend Setter — Curriculum Learning

Authors

Carlos Florensa , David Held , Markus Wulfmeier , Michael Zhang , Pieter Abbeel

Topics

Reinforcement Learning > Methods > Policy Learning Robotics > Capabilities > Manipulation Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Learning Types > Curriculum Learning Machine Learning > Learning Paradigms > Curriculum Learning

Keywords

reinforcement learning curriculum learning robot manipulation sparse reward goal-conditioned learning goal-oriented task

Download PDF

Related papers

CORe50: a New Dataset and Benchmark for Continuous Object Recognition 2017

Active Incremental Learning of Robot Movement Primitives 2017

Efficient Automatic Perception System Parameter Tuning On Site without Expert Supervision 2017

Opportunistic Active Learning for Grounding Natural Language Descriptions 2017

Adaptable Pouring: Teaching Robots Not to Spill using Fast but Approximate Fluid Simulation 2017