2025 UAI UAI 2025

Lower Bound on Howard Policy Iteration for Deterministic Markov Decision Processes