Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Machine Learning
›
Learning Types
›
Preference Learning
102 directly classified papers
Papers per year
2007: 1
2008: 1
2009: 1
2011: 2
2012: 1
2013: 1
2014: 2
2015: 1
2016: 1
2018: 1
2019: 1
2020: 4
2021: 5
2022: 4
2023: 9
2024: 21
2025: 46
Papers
EAGLE: Expert-Guided Self-Enhancement for Preference Alignment in Pathology Large Vision-Language Model
ACL 2025
Rethinking DPO-style Diffusion Aligning Frameworks
ICCV 2025
CodeDPO: Aligning Code Models with Self Generated and Verified Source Code
ACL 2025
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
AAAI 2025
IPO: Your Language Model is Secretly a Preference Classifier
ACL 2025
Subtle Errors in Reasoning: Preference Learning via Error-injected Self-editing
ACL 2025
Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment
ACL 2025
All That Glitters is Not Gold: Improving Robust Retrieval-Augmented Language Models with Fact-Centric Preference Alignment
ACL 2025
CheXalign: Preference fine-tuning in chest X-ray interpretation models without human feedback
ACL 2025
Less Is More: Adaptive Program Repair with Bug Localization and Preference Learning
AAAI 2025
In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning
AAAI 2025
Preference-Oriented Supervised Fine-Tuning: Favoring Target Model over Aligned Large Language Models
AAAI 2025
Enhancing Audiovisual Speech Recognition Through Bifocal Preference Optimization
AAAI 2025
WEPO: Web Element Preference Optimization for LLM-based Web Navigation
AAAI 2025
MAPLE: A Framework for Active Preference Learning Guided by Large Language Models
AAAI 2025
Boost Your Human Image Generation Model via Direct Preference Optimization
CVPR 2025
Tuning Less, Prompting More: In-Context Preference Learning Pipeline for Natural Language Transformation
EMNLP 2025
Constrained Preferential Bayesian Optimization and Its Application in Banner Ad Design
IJCAI 2025
FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
ACL 2025
Personal Travel Solver: A Preference-Driven LLM-Solver System for Travel Planning
ACL 2025
M-RewardBench: Evaluating Reward Models in Multilingual Settings
ACL 2025
Geometric-Averaged Preference Optimization for Soft Preference Labels
NIPS 2024
Optimal Design for Human Preference Elicitation
NIPS 2024
Perplexity-aware Correction for Robust Alignment with Noisy Preferences
NIPS 2024
ALMol: Aligned Language-Molecule Translation LLMs through Offline Preference Contrastive Optimisation
ACL 2024
<
1
2
3
4
5
>