MOPA: Modular Object Navigation With PointGoal Agents

Sonia Raychaudhuri; Tommaso Campari; Unnat Jain; Manolis Savva; Angel X. Chang

2024 WACV WACV 2024

MOPA: Modular Object Navigation With PointGoal Agents

Abstract

We propose a simple but effective modular approach MOPA (Modular ObjectNav with PointGoal agents) to systematically investigate the inherent modularity of the object navigation task in Embodied AI. MOPA consists of four modules: (a) an object detection module trained to identify objects from RGB images, (b) a map building module to build a semantic map of the observed objects, (c) an exploration module enabling the agent to explore the environment, and (d) a navigation module to move to identified target objects. We show that we can effectively reuse a pretrained PointGoal agent as the navigation model instead of learning to navigate from scratch, thus saving time and compute. We also compare various exploration strategies for MOPA and find that a simple uniform strategy significantly outperforms more advanced exploration methods.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Reinforcement Learning and Robotics

🧭 Keyword Pioneer — point goal agent

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio