PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning

Iou-Jen Liu; Raymond A. Yeh; Alexander G. Schwing

2019 CORL CoRL 2019

PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning

Abstract

Sample efficiency and scalability to a large number of agents are two important goals for multi-agent reinforcement learning systems. Recent works got us closer to those goals, addressing non-stationarity of the environment from a single agent’s perspective by utilizing a deep net critic which depends on all observations and actions. The critic input concatenates agent observations and actions in a user-specified order. However, since deep nets aren’t permutation invariant, a permuted input changes the critic output despite the environment remaining identical. To avoid this inefficiency, we propose a ‘permutation invariant critic’ (PIC), which yields identical output irrespective of the agent permutation. This consistent representation enables our model to scale to 30 times more agents and to achieve improvements of test episode reward between 15% to 50% on the challenging multi-agent particle environment (MPE).

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — permutation invariant critic

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Iou-Jen Liu , Raymond A. Yeh , Alexander G. Schwing

Topics

Artificial Intelligence > Core AI > Multi-Agent Systems Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Methods > Multi-Agent Systems Machine Learning > Learning Types > Multi-Agent Systems Deep Learning > Learning Types > Reinforcement Learning

Keywords

deep reinforcement learning multi-agent reinforcement learning sample efficiency deep neural network permutation invariance permutation invariant critic

Download PDF

Related papers

On-Policy Robot Imitation Learning from a Converging Supervisor 2019

Learning by Cheating 2019

Object-centric Forward Modeling for Model Predictive Control 2019

Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real 2019

Combining Deep Learning and Verification for Precise Object Instance Detection 2019