2025 ALT ALT 2025

The Plug-in Approach for Average-Reward and Discounted MDPs: Optimal Sample Complexity Analysis