Papers

16,557 papers found
2025 AAAI
Multi-Reference Preference Optimization for Large Language Models
Hung Le, Quan Hung Tran, Dung Nguyen et al.
2025 AAAI
2025 AAAI
Atomic Consistency Preference Optimization for Long-Form Question Answering
Jingfeng Chen, Raghuveer Thirukovalluru, Junlin Wang et al.
2025 AACL
Disentangling Length from Quality in Direct Preference Optimization
Ryan Park, Rafael Rafailov, Stefano Ermon et al.
2024 ACL
Direct Preference Optimization with an Offset
Afra Amini, Tim Vieira, Ryan Cotterell
2024 ACL
2025 ACL
AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs
Nicholas E. Corrado, Julian Katz-Samuels, Adithya M Devraj et al.
2025 ACL