2025
ICML
ICML 2025
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Authors
Shenao Zhang
,
Zhihan Liu
,
Boyi Liu
,
Yufeng Zhang
,
Yingxiang Yang
,
Yongfei Liu
,
Liyu Chen
,
Tao Sun
,
Zhaoran Wang