2024
ICML
ICML 2024
Nash Learning from Human Feedback
Authors
Rémi Munos
,
Michal Valko
,
Daniele Calandriello
,
Mohammad Gheshlaghi azar
,
Mark Rowland
,
Zhaohan Daniel Guo
,
Yunhao Tang
,
Matthieu Geist
,
Thomas Mesnard
,
Côme Fiegel
,
Andrea Michi
,
Marco Selvi
,
Sertan Girgin
,
Nikola Momchev
,
Olivier Bachem
,
Daniel J Mankowitz
,
Doina Precup
,
Bilal Piot