On the Heterogeneity of Independent Learning Dynamics in Zero-sum Stochastic Games

Muhammed Sayin; Kemal Cetiner

2022 L4DC L4DC 2022

On the Heterogeneity of Independent Learning Dynamics in Zero-sum Stochastic Games

Abstract

We analyze the convergence properties of the two-timescale fictitious play combining the classical fictitious play with the Q-learning for two-player zero-sum stochastic games with player-dependent learning rates. We show its almost sure convergence under the standard assumptions in two-timescale stochastic approximation methods when the discount factor is less than the product of the ratios of player-dependent step sizes. To this end, we formulate a novel Lyapunov function formulation and present a one-sided asynchronous convergence result.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Muhammed Sayin , Kemal Cetiner

Topics

Artificial Intelligence > Core AI > Game AI Artificial Intelligence > Core AI > Multi-Agent Systems Machine Learning > Optimization & Theory > Stochastic Processes

Keywords

convergence analysis multi-agent learning zero-sum game stochastic game fictitious play

Download PDF

Related papers

Learning-Enabled Robust Control with Noisy Measurements 2022

Input-to-State Stable Neural Ordinary Differential Equations with Applications to Transient Modeling of Circuits 2022

Data-Driven Controller Synthesis of Unknown Nonlinear Polynomial Systems via Control Barrier Certificates 2022

Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks 2022

On the Effectiveness of Iterative Learning Control 2022