Scalable Coordinated Exploration in Concurrent Reinforcement Learning

Maria Dimakopoulou; Ian Osband; Benjamin Van Roy

2018 NIPS NeurIPS 2018

Scalable Coordinated Exploration in Concurrent Reinforcement Learning

Abstract

We consider a team of reinforcement learning agents that concurrently operate in a common environment, and we develop an approach to efficient coordinated exploration that is suitable for problems of practical scale. Our approach builds on the seed sampling concept introduced in Dimakopoulou and Van Roy (2018) and on a randomized value function learning algorithm from Osband et al. (2016). We demonstrate that, for simple tabular contexts, the approach is competitive with those previously proposed in Dimakopoulou and Van Roy (2018) and with a higher-dimensional problem and a neural network value function representation, the approach learns quickly with far fewer agents than alternative exploration schemes.

🧭 Keyword Pioneer — coordinated exploration

🐣 Hot Topic Early Bird — multi-agent reinforcement learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy

Authors

Maria Dimakopoulou , Ian Osband , Benjamin Van Roy

Topics

Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Methods > Multi-Agent Systems

Keywords

multi-agent reinforcement learning randomized value function coordinated exploration seed sampling neural network representation

Download PDF

Related papers

Maximum Causal Tsallis Entropy Imitation Learning 2018

Recurrent World Models Facilitate Policy Evolution 2018

Bandit Learning in Concave N-Person Games 2018

Algorithmic Assurance: An Active Approach to Algorithmic Testing using Bayesian Optimisation 2018

PAC-Bayes bounds for stable algorithms with instance-dependent priors 2018