2025
ICML
ICML 2025
Discriminative Finetuning of Generative Large Language Models without Reward Models and Human Preference Data
Authors
Siqi Guo
,
Ilgee Hong
,
Vicente Balmaseda
,
Changlong Yu
,
Liang Qiu
,
Xin Liu
,
Haoming Jiang
,
Tuo Zhao
,
Tianbao Yang