Safe and Nested Subgame Solving for Imperfect-Information Games

Noam Brown; Tuomas Sandholm

2017 NIPS NeurIPS 2017

Safe and Nested Subgame Solving for Imperfect-Information Games

Abstract

In imperfect-information games, the optimal strategy in a subgame may depend on the strategy in other, unreached subgames. Thus a subgame cannot be solved in isolation and must instead consider the strategy for the entire game as a whole, unlike perfect-information games. Nevertheless, it is possible to first approximate a solution for the whole game and then improve it in individual subgames. This is referred to as subgame solving. We introduce subgame-solving techniques that outperform prior methods both in theory and practice. We also show how to adapt them, and past subgame-solving techniques, to respond to opponent actions that are outside the original action abstraction; this significantly outperforms the prior state-of-the-art approach, action translation. Finally, we show that subgame solving can be repeated as the game progresses down the game tree, leading to far lower exploitability. These techniques were a key component of Libratus, the first AI to defeat top humans in heads-up no-limit Texas hold'em poker.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Reinforcement Learning

📈 Trend Setter — Multi-Agent Systems

🧭 Keyword Pioneer — subgame solving

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy

Authors

Noam Brown , Tuomas Sandholm

Topics

Artificial Intelligence > Core AI > Game AI Reinforcement Learning > Applications > Game AI Artificial Intelligence > Core AI > Game Theory Reinforcement Learning > Applications > Multi-Agent Systems

Keywords

game theory imperfect-information game subgame solving imperfect information game action abstraction

Download PDF

Related papers

High-Order Attention Models for Visual Question Answering 2017

Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Optimization 2017

Premise Selection for Theorem Proving by Deep Graph Embedding 2017

Neural Program Meta-Induction 2017

PRUNE: Preserving Proximity and Global Ranking for Network Embedding 2017