Context-DPO: Aligning Language Models for Context-Faithfulness

Baolong Bi; Shaohan Huang; Yiwei Wang; Tianchi Yang; Zihan Zhang; Haizhen Huang; Lingrui Mei; Junfeng Fang; Zehao Li; Furu Wei; Weiwei Deng; Feng Sun; Qi Zhang; Shenghua Liu

2025 ACL ACL 2025

Context-DPO: Aligning Language Models for Context-Faithfulness

Abstract

AbstractReliable responses from large language models (LLMs) require adherence to user instructions and retrieved information. While alignment techniques help LLMs align with human intentions and values, improving context-faithfulness through alignment remains underexplored. To address this, we propose Context-DPO, the first alignment method specifically designed to enhance LLMs’ context-faithfulness. We introduce ConFiQA, a benchmark that simulates Retrieval-Augmented Generation (RAG) scenarios with knowledge conflicts to evaluate context-faithfulness. By leveraging faithful and stubborn responses to questions with provided context from ConFiQA, our Context-DPO aligns LLMs through direct preference optimization. Extensive experiments demonstrate that our Context-DPO significantly improves context-faithfulness, achieving 35% to 280% improvements on popular open-source models. Further analysis demonstrates that Context-DPO preserves LLMs’ generative capabilities while providing interpretable insights into context utilization.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Baolong Bi , Shaohan Huang , Yiwei Wang , Tianchi Yang , Zihan Zhang , Haizhen Huang , Lingrui Mei , Junfeng Fang , Zehao Li , Furu Wei , Weiwei Deng , Feng Sun , Qi Zhang , Shenghua Liu

Topics

Artificial Intelligence > Core AI > Foundation Models Natural Language Processing > Resources & Methods > Large Language Models

Keywords

direct preference optimization language model alignment retrieval-augmented generation knowledge conflict context faithfulness

Download PDF

Graphically Speaking: Unmasking Abuse in Social Media with Conversation Insights 2025

CodeTool: Enhancing Programmatic Tool Invocation of LLMs via Process Supervision 2025

Structural Deep Encoding for Table Question Answering 2025

Vision-aided Unsupervised Constituency Parsing with Multi-MLLM Debating 2025

Context-DPO: Aligning Language Models for Context-Faithfulness

Abstract

Authors

Topics

Keywords

Related papers