Learning Nash Equilibrium for General-Sum Markov Games from Batch Data

Julien Pérolat; Florian Strub; Bilal Piot; Olivier Pietquin

2017 AISTATS AISTATS 2017

Learning Nash Equilibrium for General-Sum Markov Games from Batch Data

Abstract

This paper addresses the problem of learning a Nash equilibrium in $γ$-discounted multiplayer general-sum Markov Games (MGs) in a batch setting. As the number of players increases in MG, the agents may either collaborate or team apart to increase their final rewards. One solution to address this problem is to look for a Nash equilibrium. Although, several techniques were found for the subcase of two-player zero-sum MGs, those techniques fail to find a Nash equilibrium in general-sum Markov Games. In this paper, we introduce a new definition of $ε$-Nash equilibrium in MGs which grasps the strategy’s quality for multiplayer games. We prove that minimizing the norm of two Bellman-like residuals implies to learn such an $ε$-Nash equilibrium. Then, we show that minimizing an empirical estimate of the $L_p$ norm of these Bellman-like residuals allows learning for general-sum games within the batch setting. Finally, we introduce a neural network architecture that successfully learns a Nash equilibrium in generic multiplayer general-sum turn-based MGs.

🧭 Keyword Pioneer — general-sum markov game

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Reinforcement Learning

📈 Trend Setter — Multi-Agent Systems

🐣 Hot Topic Early Bird — nash equilibrium

Authors

Julien Pérolat , Florian Strub , Bilal Piot , Olivier Pietquin

Topics

Artificial Intelligence > Core AI > Game AI Artificial Intelligence > Core AI > Multi-Agent Systems Machine Learning > Learning Types > Multi-Agent Systems Artificial Intelligence > Core AI > Game Theory Reinforcement Learning > Applications > Multi-Agent Systems

Keywords

neural network architecture game theory nash equilibrium batch learning batch reinforcement learning bellman residual markov game general-sum markov game multiplayer game general-sum game multi-agent system

Download PDF

Related papers

Conditions beyond treewidth for tightness of higher-order LP relaxations 2017

Non-square matrix sensing without spurious local minima via the Burer-Monteiro approach 2017

Tensor-Dictionary Learning with Deep Kruskal-Factor Analysis 2017

A Sub-Quadratic Exact Medoid Algorithm 2017

Performance Bounds for Graphical Record Linkage 2017