2024
ICML
ICML 2024
RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Authors
Harrison Lee
,
Samrat Phatale
,
Hassan Mansoor
,
Thomas Mesnard
,
Johan Ferret
,
Kellie Ren Lu
,
Colton Bishop
,
Ethan Hall
,
Victor Cărbune
,
Abhinav Rastogi
,
Sushant Prakash