Papers

16,557 papers found
Direct Multi-Turn Preference Optimization for Language Agents
Wentao Shi, Mengqi Yuan, Junkang Wu et al.
2024 EMNLP
2024 EMNLP
2024 EMNLP
WPO: Enhancing RLHF with Weighted Preference Optimization
Wenxuan Zhou, Ravi Agrawal, Shujian Zhang et al.
2024 EMNLP
2024 EMNLP
2024 EMNLP
2024 EMNLP
Filtered Direct Preference Optimization
Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai et al.
2024 EMNLP
2024 EMNLP
Step-level Value Preference Optimization for Mathematical Reasoning
Guoxin Chen, Minpeng Liao, Chengxi Li et al.
2024 EMNLP
Direct Judgement Preference Optimization
PeiFeng Wang, Austin Xu, Yilun Zhou et al.
2025 EMNLP
2025 EMNLP
Weights-Rotated Preference Optimization for Large Language Models
Chenxu Yang, Ruipeng Jia, Mingyu Zheng et al.
2025 EMNLP