Papers

16,557 papers found
Teaching an Old LLM Secure Coding: Localized Preference Optimization on Distilled Preferences
Mohammad Saqib Hasan, Saikat Chakraborty, Santu Karmaker et al.
2025 ACL
LPOI: Listwise Preference Optimization for Vision Language Models
Fatemeh Pesaran Zadeh, Yoojin Oh, Gunhee Kim
2025 ACL
T-REG: Preference Optimization with Token-Level Reward Regularization
Wenxuan Zhou, Shujian Zhang, Lingxiao Zhao et al.
2025 ACL
K-order Ranking Preference Optimization for Large Language Models
Shihao Cai, Chongming Gao, Yang Zhang et al.
2025 ACL
Robust Preference Optimization via Dynamic Target Margins
Jie Sun, Junkang Wu, Jiancan Wu et al.
2025 ACL
2025 ACL
Reverse Preference Optimization for Complex Instruction Following
Xiang Huang, Ting-En Lin, Feiteng Fang et al.
2025 ACL