2025 ICML ICML 2025

Learning Imperfect Information Extensive-form Games with Last-iterate Convergence under Bandit Feedback