Policies Modulating Trajectory Generators

Atil Iscen; Ken Caluwaerts; Jie Tan; Tingnan Zhang; Erwin Coumans; Vikas Sindhwani; Vincent Vanhoucke

2018 CORL CoRL 2018

Policies Modulating Trajectory Generators

Abstract

We propose an architecture for learning complex controllable behaviors by having simple Policies Modulate Trajectory Generators (PMTG), a powerful combination that can provide both memory and prior knowledge to the controller. The result is a flexible architecture that is applicable to a class of problems with periodic motion for which one has an insight into the class of trajectories that might lead to a desired behavior. We illustrate the basics of our architecture using a synthetic control problem, then go on to learn speed-controlled locomotion for a quadrupedal robot by using Deep Reinforcement Learning and Evolutionary Strategies. We demonstrate that a simple linear policy, when paired with a parametric Trajectory Generator for quadrupedal gaits, can induce walking behaviors with controllable speed from 4-dimensional IMU observations alone, and can be learned in under 1000 rollouts. We also transfer these policies to a real robot and show locomotion with controllable forward velocity.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Reinforcement Learning

📈 Trend Setter — Reinforcement Learning

🧭 Keyword Pioneer — trajectory generator

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Atil Iscen , Ken Caluwaerts , Jie Tan , Tingnan Zhang , Erwin Coumans , Vikas Sindhwani , Vincent Vanhoucke

Topics

Reinforcement Learning Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Applications > Robotics Artificial Intelligence > Core AI > Robotics Artificial Intelligence > Core AI > Reinforcement Learning

Keywords

reinforcement learning policy learning quadrupedal robot trajectory generator policy modulation

Download PDF

Related papers

Batch Active Preference-Based Learning of Reward Functions 2018

Personalized Dynamics Models for Adaptive Assistive Navigation Systems 2018

Neural Modular Control for Embodied Question Answering 2018

Guided Feature Transformation (GFT): A Neural Language Grounding Module for Embodied Agents 2018

Deep Drone Racing: Learning Agile Flight in Dynamic Environments 2018