2025 ICML ICML 2025

Categorical Distributional Reinforcement Learning with Kullback-Leibler Divergence: Convergence and Asymptotics