2025 ICML ICML 2025

Discriminative Finetuning of Generative Large Language Models without Reward Models and Human Preference Data