2025 AISTATS AISTATS 2025

Infinite-Horizon Reinforcement Learning with Multinomial Logit Function Approximation