Personalized LLM Decoding via Contrasting Personal Preference

Hyungjune Bu; ChanJoo Jung; Minjae Kang; Jaehyung Kim

2025 EMNLP EMNLP 2025

Personalized LLM Decoding via Contrasting Personal Preference

Abstract

AbstractAs large language models (LLMs) are progressively deployed in various real-world applications, personalization of LLMs has become increasingly important. While various approaches to LLM personalization such as prompt-based and training-based methods have been actively explored, the development of effective decoding-time algorithms remains largely overlooked, despite their demonstrated potential. In this paper, we propose Contrasting Personal Preference (CoPe), a novel decoding-time approach applied after performing parameter-efficient fine-tuning (PEFT) on user-specific data. Our core idea is to leverage reward-guided decoding specifically for personalization by maximizing each user’s implicit reward signal. We evaluate CoPe across five open-ended personalized text generation tasks. Our empirical results demonstrate that CoPe achieves strong performance, improving personalization by an average of 10.57% in ROUGE-L without relying on external reward models or additional training procedures.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — reward-guided decoding

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Hyungjune Bu , ChanJoo Jung , Minjae Kang , Jaehyung Kim

Topics

Artificial Intelligence > Core AI > Human-AI Interaction Natural Language Processing > Generation > Text Generation Artificial Intelligence > Core AI > Large Language Models Machine Learning > Learning Types > Preference Learning Artificial Intelligence > Core AI > Natural Language Generation

Keywords

preference learning text generation parameter-efficient fine-tuning user preference large language model decoding time reward-guided decoding parameter-efficient fine tuning personalized decoding

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025