2025 AISTATS AISTATS 2025

Learning Infinite-Horizon Average-Reward Linear Mixture MDPs of Bounded Span