Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

Chenhao Li; Marin Vlastelica; Sebastian Blaes; Jonas Frey; Felix Grimminger; Georg Martius

2022 CORL CoRL 2022

Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

Abstract

Learning agile skills is one of the main challenges in robotics. To this end, reinforcement learning approaches have achieved impressive results. These methods require explicit task information in terms of a reward function or an expert that can be queried in simulation to provide a target control output, which limits their applicability. In this work, we propose a generative adversarial method for inferring reward functions from partial and potentially physically incompatible demonstrations for successful skill acquirement where reference or expert demonstrations are not easily accessible. Moreover, we show that by using a Wasserstein GAN formulation and transitions from demonstrations with rough and partial information as input, we are able to extract policies that are robust and capable of imitating demonstrated behaviors. Finally, the obtained skills such as a backflip are tested on an agile quadruped robot called Solo 8 and present faithful replication of hand-held human demonstrations.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — partial demonstration

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Chenhao Li , Marin Vlastelica , Sebastian Blaes , Jonas Frey , Felix Grimminger , Georg Martius

Topics

Machine Learning > Learning Types > Adversarial Learning Reinforcement Learning > Applications > Robotics

Keywords

reward inference wasserstein gan quadruped robot skill acquisition adversarial imitation partial demonstration

Download PDF

Related papers

One-Shot Transfer of Affordance Regions? AffCorrs! 2022

RoboTube: Learning Household Manipulation from Human Videos with Simulated Twin Environments 2022

Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning 2022

Watch and Match: Supercharging Imitation with Regularized Optimal Transport 2022

Offline Reinforcement Learning for Visual Navigation 2022