Coordinated Exploration in Concurrent Reinforcement Learning

Maria Dimakopoulou; Benjamin Van Roy

2018 ICML ICML 2018

Coordinated Exploration in Concurrent Reinforcement Learning

Abstract

We consider a team of reinforcement learning agents that concurrently learn to operate in a common environment. We identify three properties - adaptivity, commitment, and diversity - which are necessary for efficient coordinated exploration and demonstrate that straightforward extensions to single-agent optimistic and posterior sampling approaches fail to satisfy them. As an alternative, we propose seed sampling, which extends posterior sampling in a manner that meets these requirements. Simulation results investigate how per-agent regret decreases as the number of agents grows, establishing substantial advantages of seed sampling over alternative exploration schemes.

🧭 Keyword Pioneer — coordinated exploration

🐣 Hot Topic Early Bird — multi-agent reinforcement learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Reinforcement Learning, Robotics, Security & Privacy

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

Authors

Maria Dimakopoulou , Benjamin Van Roy

Topics

Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Methods > Multi-Agent Systems Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Learning Types > Multi-Agent Systems Reinforcement Learning > Applications > Multi-Agent Systems

Keywords

multi-agent reinforcement learning reinforcement learning posterior sampling regret bound coordinated exploration seed sampling multi-agent system

Download PDF

Related papers

Rectify Heterogeneous Models with Semantic Mapping 2018

Bayesian Optimization of Combinatorial Structures 2018

The Well-Tempered Lasso 2018

Approximation Algorithms for Cascading Prediction Models 2018

Classification from Pairwise Similarity and Unlabeled Data 2018