Coco-Q: Learning in Stochastic Games with Side Payments

Eric Sodomka; Elizabeth Hilliard; Michael Littman; Amy Greenwald

2013 ICML ICML 2013

Coco-Q: Learning in Stochastic Games with Side Payments

Abstract

Coco (""cooperative/competitive"") values are a solution concept for two-player normal-form games with transferable utility, when binding agreements and side payments between players are possible. In this paper, we show that coco values can also be defined for stochastic games and can be learned using a simple variant of Q-learning that is provably convergent. We provide a set of examples showing how the strategies learned by the Coco-Q algorithm relate to those learned by existing multiagent Q-learning algorithms.

🚀 Conference Pioneer — ICML 2013

🌉 Interdisciplinary Bridge — Artificial Intelligence and Reinforcement Learning

📈 Trend Setter — Multi-Agent Systems

🧭 Keyword Pioneer — cooperative competitive game

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

Authors

Eric Sodomka , Elizabeth Hilliard , Michael Littman , Amy Greenwald

Topics

Artificial Intelligence > Core AI > Game AI Reinforcement Learning > Methods > Multi-Agent Systems

Keywords

multi-agent learning stochastic game cooperative competitive game side payment

Download PDF

Related papers

Convex Adversarial Collective Classification 2013

Gaussian Process Vine Copulas for Multivariate Dependence 2013

Stochastic Simultaneous Optimistic Optimization 2013

Generic Exploration and K-armed Voting Bandits 2013

Robust Structural Metric Learning 2013