← Back to papers

2024 ICML ICML 2024

Nash Learning from Human Feedback

Authors

Rémi Munos , Michal Valko , Daniele Calandriello , Mohammad Gheshlaghi azar , Mark Rowland , Zhaohan Daniel Guo , Yunhao Tang , Matthieu Geist , Thomas Mesnard , Côme Fiegel , Andrea Michi , Marco Selvi , Sertan Girgin , Nikola Momchev , Olivier Bachem , Daniel J Mankowitz , Doina Precup , Bilal Piot

Related papers

Learning Latent Dynamic Robust Representations for World Models 2024

Beyond Individual Input for Deep Anomaly Detection on Tabular Data 2024

Risk Estimation in a Markov Cost Process: Lower and Upper Bounds 2024

Collapse-Aware Triplet Decoupling for Adversarially Robust Image Retrieval 2024

Ranking-based Client Imitation Selection for Efficient Federated Learning 2024