Few-shot Object Grounding and Mapping for Natural Language Robot Instruction Following

Valts Blukis; Ross Knepper; Yoav Artzi

2020 CORL CoRL 2020

Few-shot Object Grounding and Mapping for Natural Language Robot Instruction Following

Abstract

We study the problem of learning a robot policy to follow natural language instructions that can be easily extended to reason about new objects. We introduce a few-shot language-conditioned object grounding method trained from augmented reality data that uses exemplars to identify objects and align them to their mentions in instructions. We present a learned map representation that encodes object locations and their instructed use, and construct it from our few-shot grounding output. We integrate this mapping approach into an instruction-following policy, thereby allowing it to reason about previously unseen objects at test-time by simply adding exemplars. We evaluate on the task of learning to map raw observations and instructions to continuous control of a physical quadcopter. Our approach significantly outperforms the prior state of the art in the presence of new objects, even when the prior approach observes all objects during training.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Reinforcement Learning

🧭 Keyword Pioneer — robot instruction

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Valts Blukis , Ross Knepper , Yoav Artzi

Topics

Artificial Intelligence > Learning Paradigms > Few-Shot Learning Reinforcement Learning > Applications > Robotics

Keywords

few-shot learning natural language object grounding robot instruction

Download PDF

Related papers

Augmenting GAIL with BC for sample efficient imitation learning 2020

Neuro-Symbolic Program Search for Autonomous Driving Decision Module Design 2020

LiRaNet: End-to-End Trajectory Prediction using Spatio-Temporal Radar Fusion 2020

DROGON: A Trajectory Prediction Model based on Intention-Conditioned Behavior Reasoning 2020

CAMPs: Learning Context-Specific Abstractions for Efficient Planning in Factored MDPs 2020