Papers

16,557 papers found
Aligning Visual Contrastive learning models via Preference Optimization
Amirabbas Afzali, Borna khodabandeh, Ali Rasekh et al.
2025 ICLR
2025 ICLR
Self-Improving Robust Preference Optimization
Eugene Choi, Arash Ahmadian, Matthieu Geist et al.
2025 ICLR
Self-Play Preference Optimization for Language Model Alignment
Yue Wu, Zhiqing Sun, Huizhuo Yuan et al.
2025 ICLR
2025 ICLR
2025 ICLR
2025 ICLR
Weighted-Reward Preference Optimization for Implicit Model Fusion
Ziyi Yang, Fanqi Wan, Longguang Zhong et al.
2025 ICLR
2025 ICLR
2025 ICLR
Preference Optimization for Reasoning with Pseudo Feedback
Fangkai Jiao, Geyang Guo, Xingxing Zhang et al.
2025 ICLR
2025 ICLR
2025 ICLR
2025 ICLR
Modality-Fair Preference Optimization for Trustworthy MLLM Alignment
Songtao Jiang, Yan Zhang, Ruizhe Chen et al.
2025 IJCAI