Hierarchical and Partially Observable Goal-Driven Policy Learning With Goals Relational Graph

Xin Ye; Yezhou Yang

2021 CVPR CVPR 2021

Hierarchical and Partially Observable Goal-Driven Policy Learning With Goals Relational Graph

Abstract

We present a novel two-layer hierarchical reinforcement learning approach equipped with a Goals Relational Graph (GRG) for tackling the partially observable goal-driven task, such as goal-driven visual navigation. Our GRG captures the underlying relations of all goals in the goal space through a Dirichlet-categorical process that facilitates: 1) the high-level network raising a sub-goal towards achieving a designated final goal; 2) the low-level network towards an optimal policy; and 3) the overall system generalizing unseen environments and goals. We evaluate our approach with two settings of partially observable goal-driven tasks -- a grid-world domain and a robotic object search task. Our experimental results show that our approach exhibits superior generalization performance on both unseen environments and new goals.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Reinforcement Learning

🧭 Keyword Pioneer — goal-driven policy learning

🐣 Hot Topic Early Bird — partially observable markov decision process

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Xin Ye , Yezhou Yang

Topics

Artificial Intelligence > Core AI > Multi-Agent Systems Artificial Intelligence > Core AI > Planning Artificial Intelligence > Bayesian & Probabilistic > Probabilistic Modeling Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Applications > Robotics

Keywords

policy learning visual navigation hierarchical reinforcement learning partially observable markov decision process partial observability graph neural network goal-driven navigation goal-driven policy learning goals relational graph partially observable task dirichlet-categorical process

Download PDF

Related papers

Learning To Reconstruct High Speed and High Dynamic Range Videos From Events 2021

DeFLOCNet: Deep Image Editing via Flexible Low-Level Controls 2021

Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs 2021

Coming Down to Earth: Satellite-to-Street View Synthesis for Geo-Localization 2021

Pose-Guided Human Animation From a Single Image in the Wild 2021