2025 ICML ICML 2025

Linear $Q$-Learning Does Not Diverge in $L^2$: Convergence Rates to a Bounded Set