2025 COLT COLT 2025

Span-Agnostic Optimal Sample Complexity and Oracle Inequalities for Average-Reward RL