2024
ICML
ICML 2024
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Authors
Jesse Farebrother
,
Jordi Orbay
,
Quan Vuong
,
Adrien Ali Taiga
,
Yevgen Chebotar
,
Ted Xiao
,
Alex Irpan
,
Sergey Levine
,
Pablo Samuel Castro
,
Aleksandra Faust
,
Aviral Kumar
,
Rishabh Agarwal