Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Learning Types
Machine Learning
›
Learning Types
›
Preference Learning
102 directly classified papers
Papers per year
2007: 1
2008: 1
2009: 1
2011: 2
2012: 1
2013: 1
2014: 2
2015: 1
2016: 1
2018: 1
2019: 1
2020: 4
2021: 5
2022: 4
2023: 9
2024: 21
2025: 46
Papers
GAPO: Learning Preferential Prompt through Generative Adversarial Policy Optimization
ACL 2025
GainRAG: Preference Alignment in Retrieval-Augmented Generation through Gain Signal Synthesis
ACL 2025
MAPLE: A Framework for Active Preference Learning Guided by Large Language Models
AAAI 2025
M-RewardBench: Evaluating Reward Models in Multilingual Settings
ACL 2025
Aligning Large Language Models with Implicit Preferences from User-Generated Content
ACL 2025
Enhancing Audiovisual Speech Recognition Through Bifocal Preference Optimization
AAAI 2025
Preference-Oriented Supervised Fine-Tuning: Favoring Target Model over Aligned Large Language Models
AAAI 2025
Instantly Learning Preference Alignment via In-context DPO
NAACL 2025
Tuning Less, Prompting More: In-Context Preference Learning Pipeline for Natural Language Transformation
EMNLP 2025
In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning
AAAI 2025
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
ACL 2025
Spatial Preference Rewarding for MLLMs Spatial Understanding
ICCV 2025
Rethinking DPO-style Diffusion Aligning Frameworks
ICCV 2025
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
AAAI 2025
Subtle Errors in Reasoning: Preference Learning via Error-injected Self-editing
ACL 2025
All That Glitters is Not Gold: Improving Robust Retrieval-Augmented Language Models with Fact-Centric Preference Alignment
ACL 2025
MWPO: Enhancing LLMs Performance through Multi-Weight Preference Strength and Length Optimization
ACL 2025
AbsVis – Benchmarking How Humans and Vision-Language Models “See” Abstract Concepts in Images
EMNLP 2025
Optimising Factual Consistency in Summarisation via Preference Learning from Multiple Imperfect Metrics
EMNLP 2025
Boost Your Human Image Generation Model via Direct Preference Optimization
CVPR 2025
Constrained Preferential Bayesian Optimization and Its Application in Banner Ad Design
IJCAI 2025
Less Is More: Adaptive Program Repair with Bug Localization and Preference Learning
AAAI 2025
ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization
ICCV 2025
WEPO: Web Element Preference Optimization for LLM-based Web Navigation
AAAI 2025
Divide-Then-Align: Honest Alignment based on the Knowledge Boundary of RAG
ACL 2025
<
1
2
3
4
5
>