Neural Modular Control for Embodied Question Answering

Abhishek Das; Georgia Gkioxari; Stefan Lee; Devi Parikh; Dhruv Batra

2018 CORL CoRL 2018

Neural Modular Control for Embodied Question Answering

Abstract

We present a modular approach for learning policies for navigation over long planning horizons from language input. Our hierarchical policy operates at multiple timescales, where the higher-level master policy proposes subgoals to be executed by specialized sub-policies. Our choice of subgoals is compositional and semantic, i.e. they can be sequentially combined in arbitrary orderings, and assume human-interpretable descriptions (e.g. ‘exit room’, ‘find kitchen’, ‘find refrigerator’, etc.). We use imitation learning to warm-start policies at each level of the hierarchy, dramatically increasing sample efficiency, followed by reinforcement learning. Independent reinforcement learning at each level of hierarchy enables sub-policies to adapt to consequences of their actions and recover from errors. Subsequent joint hierarchical training enables the master policy to adapt to the sub-policies. On the challenging EQA [1] benchmark in House3D [2], requiring navigating diverse realistic indoor environments, our approach outperforms prior work by a significant margin, both in terms of navigation and question answering.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Natural Language Processing and Robotics

📈 Trend Setter — Question Answering

🧭 Keyword Pioneer — embodied question answering

🐣 Hot Topic Early Bird — imitation learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

Authors

Abhishek Das , Georgia Gkioxari , Stefan Lee , Devi Parikh , Dhruv Batra

Topics

Artificial Intelligence > Core AI > Planning Natural Language Processing > Applications > Question Answering Robotics > Capabilities > Navigation

Keywords

reinforcement learning imitation learning embodied question answering hierarchical policy language navigation subgoal planning

Download PDF

Related papers

Batch Active Preference-Based Learning of Reward Functions 2018

Personalized Dynamics Models for Adaptive Assistive Navigation Systems 2018

Guided Feature Transformation (GFT): A Neural Language Grounding Module for Embodied Agents 2018

Deep Drone Racing: Learning Agile Flight in Dynamic Environments 2018

Fast 3D Modeling with Approximated Convolutional Kernels 2018