On the convergence of no-regret learning in selfish routing

Walid Krichene; Benjamin Drighès; Alexandre Bayen

2014 ICML ICML 2014

On the convergence of no-regret learning in selfish routing

Abstract

We study the repeated, non-atomic routing game, in which selfish players make a sequence of routing decisions. We consider a model in which players use regret-minimizing algorithms as the learning mechanism, and study the resulting dynamics. We are concerned in particular with the convergence to the set of Nash equilibria of the routing game. No-regret learning algorithms are known to guarantee convergence of a subsequence of population strategies. We are concerned with convergence of the actual sequence. We show that convergence holds for a large class of online learning algorithms, inspired from the continuous-time replicator dynamics. In particular, the discounted Hedge algorithm is proved to belong to this class, which guarantees its convergence.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

📈 Trend Setter — Game AI

🧭 Keyword Pioneer — replicator dynamics

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Reinforcement Learning, Robotics

🐣 Hot Topic Early Bird — game theory

Authors

Walid Krichene , Benjamin Drighès , Alexandre Bayen

Topics

Artificial Intelligence > Core AI > Game AI Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Learning Types > Multi-Agent Systems Mathematics & Optimization > Optimization > Game Theory Artificial Intelligence > Core AI > Game Theory

Keywords

online learning game theory convergence analysis regret minimization nash equilibrium no-regret learning replicator dynamics selfish routing routing game

Download PDF

Related papers

Demystifying Information-Theoretic Clustering 2014

Margins, Kernels and Non-linear Smoothed Perceptrons 2014

Large-Margin Metric Learning for Constrained Partitioning Problems 2014

Efficient Approximation of Cross-Validation for Kernel Methods using Bouligand Influence Function 2014

Generalized Exponential Concentration Inequality for Renyi Divergence Estimation 2014