Papers
16,557 papers found
CAPO: Confidence Aware Preference Optimization Learning for Multilingual Preferences
Rhitabrat Pokharel, Yufei Tao, Ameeta Agrawal
NHK Submission to WAT 2025: Leveraging Preference Optimization for Article-level Japanese–English News Translation
Hideya Mino, Rei Endo, Yoshihiko Kawai
High-Dimensional Dueling Optimization with Preference Embedding
Yangwenhui Zhang, Hong Qian, Xiang Shu et al.
Preference Ranking Optimization for Human Alignment
Feifan Song, Bowen Yu, Minghao Li et al.
FIPO: Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning Schema
Junru Lu, Siyu An, Min Zhang et al.
POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation
Lanyun Zhu, Tianrun Chen, Qianxiong Xu et al.
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Arun Verma, Zhongxiang Dai, Xiaoqiang Lin et al.
Relation-Augmented Dueling Bayesian Optimization via Preference Propagation
Xiang Xia, Xiang Shu, Shuo Liu et al.
Direct Preference-based Policy Optimization without Reward Modeling
Gaon An, Junhyeok Lee, Xingdong Zuo et al.
Gradient-Based Optimization for Bayesian Preference Elicitation
Ivan Vendrov, Tyler Lu, Qingqing Huang et al.
Multi-Objective Bayesian Optimization with Active Preference Learning
Ryota Ozaki, Kazuki Ishikawa, Youhei Kanzaki et al.
DreamAlign: Dynamic Text-to-3D Optimization with Human Preference Alignment
Gaofeng Liu, Zhiyuan Ma, Tao Fang
Multi-attribute Bayesian optimization with interactive preference learning
Raul Astudillo, Peter Frazier
DORM: Preference Data Weights Optimization for Reward Modeling in LLM Alignment
Rongzhi Zhang, Chenwei Zhang, Xinyang Zhang et al.
Multimodal Large Language Model-Guided ISP Hyperparameter Optimization with Dynamic Preference Learning
Xinyu Sun, Zhikun Zhao, Congyan Lang et al.
Beyond Reward: Offline Preference-guided Policy Optimization
Yachen Kang, Diyuan Shi, Jinxin Liu et al.
Suit the Remedy to the Retriever: Interpretable Query Optimization with Retriever Preference Alignment for Vision-Language Retrieval
GuangHao Meng, Jinpeng Wang, Jieming Zhu et al.
Token-level Preference Self-Alignment Optimization for Multi-style Outline Controllable Generation
Zihao Li, Xuekong Xu, Ziyao Chen et al.
MWPO: Enhancing LLMs Performance through Multi-Weight Preference Strength and Length Optimization
Shiyue Xu, Fu Zhang, Jingwei Cheng et al.
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Hyungkyu Kang, Min-hwan Oh
Preference Exploration for Efficient Bayesian Optimization with Multiple Outcomes
Zhiyuan Jerry Lin, Raul Astudillo, Peter Frazier et al.
Direct Preference-Based Evolutionary Multi-Objective Optimization with Dueling Bandits
Tian Huang, Shengbo Wang, Ke Li