Papers

16,557 papers found
2025 AACL
High-Dimensional Dueling Optimization with Preference Embedding
Yangwenhui Zhang, Hong Qian, Xiang Shu et al.
2023 AAAI
Preference Ranking Optimization for Human Alignment
Feifan Song, Bowen Yu, Minghao Li et al.
2024 AAAI
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Arun Verma, Zhongxiang Dai, Xiaoqiang Lin et al.
2025 ICLR
Direct Preference-based Policy Optimization without Reward Modeling
Gaon An, Junhyeok Lee, Xingdong Zuo et al.
2023 NIPS
Gradient-Based Optimization for Bayesian Preference Elicitation
Ivan Vendrov, Tyler Lu, Qingqing Huang et al.
2020 AAAI
Multi-Objective Bayesian Optimization with Active Preference Learning
Ryota Ozaki, Kazuki Ishikawa, Youhei Kanzaki et al.
2024 AAAI
DORM: Preference Data Weights Optimization for Reward Modeling in LLM Alignment
Rongzhi Zhang, Chenwei Zhang, Xinyang Zhang et al.
2025 EMNLP
Beyond Reward: Offline Preference-guided Policy Optimization
Yachen Kang, Diyuan Shi, Jinxin Liu et al.
2023 ICML
Preference Exploration for Efficient Bayesian Optimization with Multiple Outcomes
Zhiyuan Jerry Lin, Raul Astudillo, Peter Frazier et al.
2022 AISTATS