2024 ICML ICML 2024

Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF