Learning to Play General-Sum Games against Multiple Boundedly Rational Agents

Eric Zhao; Alexander R. Trott; Caiming Xiong; Stephan Zheng

2023 AAAI AAAI 2023

Learning to Play General-Sum Games against Multiple Boundedly Rational Agents

Abstract

Abstract We study the problem of training a principal in a multi-agent general-sum game using reinforcement learning (RL). Learning a robust principal policy requires anticipating the worst possible strategic responses of other agents, which is generally NP-hard. However, we show that no-regret dynamics can identify these worst-case responses in poly-time in smooth games. We propose a framework that uses this policy evaluation method for efficiently learning a robust principal policy using RL. This framework can be extended to provide robustness to boundedly rational agents too. Our motivating application is automated mechanism design: we empirically demonstrate our framework learns robust mechanisms in both matrix games and complex spatiotemporal games. In particular, we learn a dynamic tax policy that improves the welfare of a simulated trade-and-barter economy by 15%, even when facing previously unseen boundedly rational RL taxpayers.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — boundedly rational agent

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Eric Zhao , Alexander R. Trott , Caiming Xiong , Stephan Zheng

Topics

Artificial Intelligence > Core AI > Agent Systems Reinforcement Learning > Applications > Game AI Machine Learning > Learning Types > Multi-Agent Systems Artificial Intelligence > Core AI > Game Theory Deep Learning > Learning Types > Reinforcement Learning

Keywords

reinforcement learning bounded rationality no-regret dynamic general-sum game automated mechanism design principal-agent game boundedly rational agent

Download PDF

Related papers

A Model-Agnostic Heuristics for Selective Classification 2023

Tackling Safe and Efficient Multi-Agent Reinforcement Learning via Dynamic Shielding (Student Abstract) 2023

Head-Free Lightweight Semantic Segmentation with Linear Transformer 2023

Hierarchical ConViT with Attention-Based Relational Reasoner for Visual Analogical Reasoning 2023

Deep Spiking Neural Networks with High Representation Similarity Model Visual Pathways of Macaque and Mouse 2023