Eliciting Compatible Demonstrations for Multi-Human Imitation Learning

Kanishk Gandhi; Siddharth Karamcheti; Madeline Liao; Dorsa Sadigh

2022 CORL CoRL 2022

Eliciting Compatible Demonstrations for Multi-Human Imitation Learning

Abstract

Imitation learning from human-provided demonstrations is a strong approach for learning policies for robot manipulation. While the ideal dataset for imitation learning is homogenous and low-variance - reflecting a single, optimal method for performing a task - natural human behavior has a great deal of heterogeneity, with several optimal ways to demonstrate a task. This multimodality is inconsequential to human users, with task variations manifesting as subconscious choices; for example, reaching down, then across to grasp an object, versus reaching across, then down. Yet, this mismatch presents a problem for interactive imitation learning, where sequences of users improve on a policy by iteratively collecting new, possibly conflicting demonstrations. To combat this problem of demonstrator incompatibility, this work designs an approach for 1) measuring the compatibility of a new demonstration given a base policy, and 2) actively eliciting more compatible demonstrations from new users. Across two simulation tasks requiring long-horizon, dexterous manipulation and a real-world “food plating” task with a Franka Emika Panda arm, we show that we can both identify incompatible demonstrations via post-hoc filtering, and apply our compatibility measure to actively elicit compatible demonstrations from new users, leading to improved task success rates across simulated and real environments.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — demonstrator compatibility

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Kanishk Gandhi , Siddharth Karamcheti , Madeline Liao , Dorsa Sadigh

Topics

Artificial Intelligence > Core AI > Multi-Agent Systems Reinforcement Learning > Applications > Robotics Machine Learning > Learning Types > Imitation Learning

Keywords

imitation learning interactive learning human-robot interaction robot manipulation demonstrator compatibility multi-human learning

Download PDF

Related papers

One-Shot Transfer of Affordance Regions? AffCorrs! 2022

RoboTube: Learning Household Manipulation from Human Videos with Simulated Twin Environments 2022

Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning 2022

Watch and Match: Supercharging Imitation with Regularized Optimal Transport 2022

Offline Reinforcement Learning for Visual Navigation 2022