← Learning Types

Machine Learning › Learning Types ›

Preference Learning

102 directly classified papers

Papers per year

Papers

GAPO: Learning Preferential Prompt through Generative Adversarial Policy Optimization ACL 2025

GainRAG: Preference Alignment in Retrieval-Augmented Generation through Gain Signal Synthesis ACL 2025

MAPLE: A Framework for Active Preference Learning Guided by Large Language Models AAAI 2025

M-RewardBench: Evaluating Reward Models in Multilingual Settings ACL 2025

Aligning Large Language Models with Implicit Preferences from User-Generated Content ACL 2025

Enhancing Audiovisual Speech Recognition Through Bifocal Preference Optimization AAAI 2025

Preference-Oriented Supervised Fine-Tuning: Favoring Target Model over Aligned Large Language Models AAAI 2025

Instantly Learning Preference Alignment via In-context DPO NAACL 2025

Tuning Less, Prompting More: In-Context Preference Learning Pipeline for Natural Language Transformation EMNLP 2025

In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning AAAI 2025

Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback ACL 2025

Spatial Preference Rewarding for MLLMs Spatial Understanding ICCV 2025

Rethinking DPO-style Diffusion Aligning Frameworks ICCV 2025

Forward KL Regularized Preference Optimization for Aligning Diffusion Policies AAAI 2025

Subtle Errors in Reasoning: Preference Learning via Error-injected Self-editing ACL 2025

All That Glitters is Not Gold: Improving Robust Retrieval-Augmented Language Models with Fact-Centric Preference Alignment ACL 2025

MWPO: Enhancing LLMs Performance through Multi-Weight Preference Strength and Length Optimization ACL 2025

AbsVis – Benchmarking How Humans and Vision-Language Models “See” Abstract Concepts in Images EMNLP 2025

Optimising Factual Consistency in Summarisation via Preference Learning from Multiple Imperfect Metrics EMNLP 2025

Boost Your Human Image Generation Model via Direct Preference Optimization CVPR 2025

Constrained Preferential Bayesian Optimization and Its Application in Banner Ad Design IJCAI 2025

Less Is More: Adaptive Program Repair with Bug Localization and Preference Learning AAAI 2025

ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization ICCV 2025

WEPO: Web Element Preference Optimization for LLM-based Web Navigation AAAI 2025

Divide-Then-Align: Honest Alignment based on the Knowledge Boundary of RAG ACL 2025