Successor Features Based Multi-Agent RL for Event-Based Decentralized MDPs

Tarun Gupta; Akshat Kumar; Praveen Paruchuri

2019 AAAI AAAI 2019

Successor Features Based Multi-Agent RL for Event-Based Decentralized MDPs

Abstract

Abstract Decentralized MDPs (Dec-MDPs) provide a rigorous framework for collaborative multi-agent sequential decisionmaking under uncertainty. However, their computational complexity limits the practical impact. To address this, we focus on a class of Dec-MDPs consisting of independent collaborating agents that are tied together through a global reward function that depends upon their entire histories of states and actions to accomplish joint tasks. To overcome scalability barrier, our main contributions are: (a) We propose a new actor-critic based Reinforcement Learning (RL) approach for event-based Dec-MDPs using successor features (SF) which is a value function representation that decouples the dynamics of the environment from the rewards; (b) We then present Dec-ESR (Decentralized Event based Successor Representation) which generalizes learning for event-based Dec-MDPs using SF within an end-to-end deep RL framework; (c) We also show that Dec-ESR allows useful transfer of information on related but different tasks, hence bootstraps the learning for faster convergence on new tasks; (d) For validation purposes, we test our approach on a large multi-agent coverage problem which models schedule coordination of agents in a real urban subway network and achieves better quality solutions than previous best approaches.

🚀 Conference Pioneer — AAAI 2019

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — decentralized markov decision process

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Tarun Gupta , Akshat Kumar , Praveen Paruchuri

Topics

Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Methods > Multi-Agent Systems Reinforcement Learning > Applications > Value Iteration Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Learning Types > Transfer Learning Reinforcement Learning > Applications > Multi-Agent Systems

Keywords

multi-agent reinforcement learning transfer learning value function successor feature decentralized markov decision process event-based control event-based planning decentralized mdp

Download PDF

Related papers

Cooperative Multimodal Approach to Depression Detection in Twitter 2019

Learning to Align Question and Answer Utterances in Customer Service Conversation with Recurrent Pointer Networks 2019

Community Detection in Social Networks Considering Topic Correlations 2019

Session-Based Recommendation with Graph Neural Networks 2019

Blameworthiness in Multi-Agent Settings 2019