Papers
16,557 papers found
Causal Direct Preference Optimization for Language Model Alignment
Uyen Le, Thin Nguyen, Toan Nguyen et al.
Attribute-Controlled Translation with Preference Optimization
Inigo Jauregi Unanue, Najmeh Sadoughi, Vimal Bhat et al.
Benchmarking Direct Preference Optimization for Medical Large Vision–Language Models
Dain Kim, Jiwoo Lee, Jaehoon Yun et al.
IRPO: Implicit Policy Regularized Preference Optimization
Youngsoo Jang, Yu Jin Kim, Geon-Hyeong Kim et al.
EPO: Diverse and Realistic Protein Ensemble Generation via Energy Preference Optimization
Yuancheng Sun, Yuxuan Ren, Zhaoming Chen et al.
Margin-Aware Preference Optimization for Aligning Diffusion Models Without Reference
Jiwoo Hong, Sayak Paul, Noah Lee et al.
FocusDPO: Dynamic Preference Optimization for Multi-Subject Personalized Image Generation via Adaptive Focus
Qiaoqiao Jin, Siming Fu, Dong She et al.
Rethinking Direct Preference Optimization in Diffusion Models
Junyong Kang, Seohyun Lim, Kyungjune Baek et al.
Test-Time Preference Optimization for Image Restoration
Bingchen Li, Xin Li, Jiaqi Xu et al.
FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation
Mengchao Wang, Wang Qiang, Fan Jiang et al.
Diffusion Distillation with Direct Preference Optimization for Efficient 3D LiDAR Scene Completion
An Zhao, Shengyuan Zhang, Zejian Li et al.
OmniDPO: A Preference Optimization Framework to Address Omni-Modal Hallucination
Junzhe Chen, Tianshu Zhang, Shiyu Huang et al.
DEPO: Dual-Efficiency Preference Optimization for LLM Agents
Sirui Chen, Mengshi Zhao, Lei Xu et al.
LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization
Junsong Li, Jie Zhou, Bihao Zhan et al.
Query-Routed Activation Editing with Truth-hierarchical Preference Optimization
Kewei Liao, Tianbo Wang, Yuqing Ma et al.
Textual Self-Attention Network: Test-Time Preference Optimization Through Textual Gradient-Based Attention
Shibing Mo, Haoyang Ruan, Kai Wu et al.
Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model
Junshu Pan, Wei Shen, Shulin Huang et al.
LLMdoctor: Token-Level Flow-Guided Preference Optimization for Efficient Test-Time Alignment of Large Language Models
Tiesunlong Shen, Rui Mao, Jin Wang et al.
Multi-level Style Preference Optimization: An Adaptive Detection Framework for Human-Machine Hybrid Text
Zehao Wang, Lianwei Wu, Wenbo An et al.
AP2O-Coder: Adaptively Progressive Preference Optimization for Reducing Compilation and Runtime Errors in LLM-Generated Code
Jianqing Zhang, Wei Xia, Hande Dong et al.
MetaGDPO: Alleviating Catastrophic Forgetting with Metacognitive Knowledge Through Group Direct Preference Optimization
Lanxue Zhang, Yuqiang Xie, Fang Fang et al.
Preference Optimization via Contrastive Divergence: Your Policy Is Secretly an NLL Estimator
Zhuotong Chen, Fang Liu, Xuan Zhu et al.
AMaPO: Adaptive Margin-attached Preference Optimization for Language Model Alignment
Ruibo Deng, Duanyu Feng, Wenqiang Lei
DETONATE – A Benchmark for Text-to-Image Alignment and Kernelized Direct Preference Optimization
Renjith Prasad Kaippilly Mana, Abhilekh Borah, Hasnat Md Abdullah et al.