2025 AISTATS AISTATS 2025

Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis