2025 AISTATS AISTATS 2025

Common Learning Constraints Alter Interpretations of Direct Preference Optimization