2024 ICML ICML 2024

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL