PONI: Potential Functions for ObjectGoal Navigation With Interaction-Free Learning

Santhosh Kumar Ramakrishnan; Devendra Singh Chaplot; Ziad Al-Halah; Jitendra Malik; Kristen Grauman

2022 CVPR CVPR 2022

PONI: Potential Functions for ObjectGoal Navigation With Interaction-Free Learning

Abstract

State-of-the-art approaches to ObjectGoal navigation (ObjectNav) rely on reinforcement learning and typically require significant computational resources and time for learning. We propose Potential functions for ObjectGoal Navigation with Interaction-free learning (PONI), a modular approach that disentangles the skills of 'where to look?' for an object and 'how to navigate to (x, y)?'. Our key insight is that 'where to look?' can be treated purely as a perception problem, and learned without environment interactions. To address this, we propose a network that predicts two complementary potential functions conditioned on a semantic map and uses them to decide where to look for an unseen object. We train the potential function network using supervised learning on a passive dataset of top-down semantic maps, and integrate it into a modular framework to perform ObjectNav. Experiments on Gibson and Matterport3D demonstrate that our method achieves the state-of-the-art for ObjectNav while incurring up to 1,600x less computational cost for training. Code and pre-trained models are available.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Machine Learning and Reinforcement Learning and Robotics

🧭 Keyword Pioneer — interaction free learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Santhosh Kumar Ramakrishnan , Devendra Singh Chaplot , Ziad Al-Halah , Jitendra Malik , Kristen Grauman

Topics

Artificial Intelligence > Core AI > Planning Machine Learning > Learning Types > Self-Supervised Learning Computer Vision > Domain-Specific > Autonomous Driving Reinforcement Learning > Applications > Robotics Machine Learning > Learning Types > Supervised Learning Artificial Intelligence > Core AI > Robotics Artificial Intelligence > Core AI > Reinforcement Learning Computer Vision > Domain-Specific > Robotics Robotics > Applications > Robotics

Keywords

reinforcement learning supervised learning object goal navigation semantic map potential function modular framework semantic mapping object-goal navigation interaction free learning objectgoal navigation interaction-free learning

Download PDF

Related papers

UniCoRN: A Unified Conditional Image Repainting Network 2022

Why Discard if You Can Recycle?: A Recycling Max Pooling Module for 3D Point Cloud Analysis 2022

All-in-One Image Restoration for Unknown Corruption 2022

Stability-Driven Contact Reconstruction From Monocular Color Images 2022

Forecasting Characteristic 3D Poses of Human Actions 2022