Reciprocal MIND MELD: Improving Learning From Demonstration via Personalized, Reciprocal Teaching

Mariah L Schrum; Erin Hedlund-Botti; Matthew Gombolay

2022 CORL CoRL 2022

Reciprocal MIND MELD: Improving Learning From Demonstration via Personalized, Reciprocal Teaching

Abstract

Endowing robots with the ability to learn novel tasks via demonstrations will increase the accessibility of robots for non-expert, non-roboticists. However, research has shown that humans can be poor teachers, making it difficult for robots to effectively learn from humans. If the robot could instruct humans how to provide better demonstrations, then humans might be able to effectively teach a broader range of novel, out-of-distribution tasks. In this work, we introduce Reciprocal MIND MELD, a framework in which the robot learns the way in which a demonstrator is suboptimal and utilizes this information to provide feedback to the demonstrator to improve upon their demonstrations. We additionally develop an Embedding Predictor Network which learns to predict the demonstrator’s suboptimality online without the need for optimal labels. In a series of human-subject experiments in a driving simulator domain, we demonstrate that robotic feedback can effectively improve human demonstrations in two dimensions of suboptimality (p < .001) and that robotic feedback translates into better learning outcomes for a robotic agent on novel tasks (p = .045).

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — embedding predictor network

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Mariah L Schrum , Erin Hedlund-Botti , Matthew Gombolay

Topics

Machine Learning > Learning Types > Self-Supervised Learning Deep Learning > Architectures > Neural Networks Reinforcement Learning > Applications > Robotics

Keywords

human-robot interaction learning from demonstration embedding predictor network reciprocal teaching demonstrator suboptimality

Download PDF

Related papers

One-Shot Transfer of Affordance Regions? AffCorrs! 2022

RoboTube: Learning Household Manipulation from Human Videos with Simulated Twin Environments 2022

Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning 2022

Watch and Match: Supercharging Imitation with Regularized Optimal Transport 2022

Offline Reinforcement Learning for Visual Navigation 2022