Including Uncertainty when Learning from Human Corrections

Dylan P. Losey; Marcia K. O’Malley

2018 CORL CoRL 2018

Including Uncertainty when Learning from Human Corrections

Abstract

It is difficult for humans to efficiently teach robots how to correctly perform a task. One intuitive solution is for the robot to iteratively learn the human’s preferences from corrections, where the human improves the robot’s current behavior at each iteration. When learning from corrections, we argue that while the robot should estimate the most likely human preferences, it should also know what it does not know, and integrate this uncertainty as it makes decisions. We advance the state-of-the-art by introducing a Kalman filter for learning from corrections: this approach obtains the uncertainty of the estimated human preferences. Next, we demonstrate how the estimate uncertainty can be leveraged for active learning and risk-sensitive deployment. Our results indicate that obtaining and leveraging uncertainty leads to faster learning from human corrections.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

📈 Trend Setter — Active Learning

🧭 Keyword Pioneer — risk-sensitive deployment

🐣 Hot Topic Early Bird — preference learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Dylan P. Losey , Marcia K. O’Malley

Topics

Artificial Intelligence > Core AI > Human-AI Interaction Machine Learning > Optimization & Theory > Bayesian Inference Artificial Intelligence > Learning Paradigms > Active Learning

Keywords

active learning uncertainty quantification preference learning human-robot interaction kalman filter risk-sensitive deployment

Download PDF

Related papers

Batch Active Preference-Based Learning of Reward Functions 2018

Personalized Dynamics Models for Adaptive Assistive Navigation Systems 2018

Neural Modular Control for Embodied Question Answering 2018

Guided Feature Transformation (GFT): A Neural Language Grounding Module for Embodied Agents 2018

Deep Drone Racing: Learning Agile Flight in Dynamic Environments 2018