Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot

Joel Z Leibo; Edgar A Dueñez-Guzman; Alexander Vezhnevets; John P Agapiou; Peter Sunehag; Raphael Koster; Jayd Matyas; Charlie Beattie; Igor Mordatch; Thore Graepel

2021 ICML ICML 2021

Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot

Abstract

Existing evaluation suites for multi-agent reinforcement learning (MARL) do not assess generalization to novel situations as their primary objective (unlike supervised learning benchmarks). Our contribution, Melting Pot, is a MARL evaluation suite that fills this gap and uses reinforcement learning to reduce the human labor required to create novel test scenarios. This works because one agent’s behavior constitutes (part of) another agent’s environment. To demonstrate scalability, we have created over 80 unique test scenarios covering a broad range of research topics such as social dilemmas, reciprocity, resource sharing, and task partitioning. We apply these test scenarios to standard MARL training algorithms, and demonstrate how Melting Pot reveals weaknesses not apparent from training performance alone.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — evaluation benchmark

🐣 Hot Topic Early Bird — multi-agent reinforcement learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Machine Learning, Natural Language Processing, Reinforcement Learning, Robotics

Authors

Joel Z Leibo , Edgar A Dueñez-Guzman , Alexander Vezhnevets , John P Agapiou , Peter Sunehag , Raphael Koster , Jayd Matyas , Charlie Beattie , Igor Mordatch , Thore Graepel

Topics

Artificial Intelligence > Core AI > Game AI Machine Learning > Optimization & Theory > Learning Theory Reinforcement Learning > Methods > Multi-Agent Systems Machine Learning > Learning Types > Multi-Agent Systems Reinforcement Learning > Applications > Multi-Agent Systems Deep Learning > Learning Types > Reinforcement Learning

Keywords

multi-agent reinforcement learning evaluation benchmark social dilemma scenario generation test scenario

Download PDF

Related papers

GRAND: Graph Neural Diffusion 2021

Almost Optimal Anytime Algorithm for Batched Multi-Armed Bandits 2021

Straight to the Gradient: Learning to Use Novel Tokens for Neural Text Generation 2021

Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution 2021

Dataset Dynamics via Gradient Flows in Probability Space 2021