2024 ICML ICML 2024

Online Matching with Stochastic Rewards: Provable Better Bound via Adversarial Reinforcement Learning