Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization

Lei Huang; Xiaocheng Feng; Weitao Ma; Yuchun Fan; Xiachong Feng; Yangfan Ye; Weihong Zhong; Yuxuan Gu; Baoxin Wang; Dayong Wu; Guoping Hu; Bing Qin

2025 ACL ACL 2025

Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization

Abstract

AbstractEnsuring contextual faithfulness in retrieval-augmented large language models (LLMs) is crucial for building trustworthy information-seeking systems, particularly in long-form question-answering (LFQA) scenarios. In this work, we identify a salient correlation between LFQA faithfulness and retrieval heads, a set of attention heads responsible for retrieving contextual information. Leveraging this insight, we propose RHIO, a framework designed to teach LLMs to explicitly discriminate between faithful and unfaithful generations. RHIO first augments unfaithful samples that simulate realistic model-intrinsic errors by selectively masking retrieval heads. Then, these samples are incorporated into joint training, enabling the model to distinguish unfaithful outputs from faithful ones conditioned on control tokens. Furthermore, these control tokens are leveraged to self-induce contrastive outputs, amplifying their difference through contrastive decoding. Additionally, to facilitate the evaluation of contextual faithfulness, we also introduce GroundBench, a comprehensive benchmark compiled from five existing LFQA datasets. Extensive experimental results on GroundBench demonstrate that RHIO significantly improves faithfulness, even outperforming GPT-4o.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — long-form question answering

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Knowledge & Reasoning, Machine Learning, Natural Language Processing, Reinforcement Learning

Authors

Lei Huang , Xiaocheng Feng , Weitao Ma , Yuchun Fan , Xiachong Feng , Yangfan Ye , Weihong Zhong , Yuxuan Gu , Baoxin Wang , Dayong Wu , Guoping Hu , Bing Qin

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Learning Types > Contrastive Learning Machine Learning > Optimization & Theory > Optimization Machine Learning > Optimization & Theory > Stochastic Processes Natural Language Processing > Applications > Question Answering Artificial Intelligence > Core AI > Large Language Models Machine Learning > Learning Types > Retrieval-Augmented Generation Deep Learning > Learning Types > Retrieval-Augmented Generation

Keywords

contrastive learning question answering attention head retrieval-augmented generation faithfulness evaluation contrastive decoding long-form question answering retrieval head control token contextual faithfulness

Download PDF

Graphically Speaking: Unmasking Abuse in Social Media with Conversation Insights 2025

CodeTool: Enhancing Programmatic Tool Invocation of LLMs via Process Supervision 2025

Structural Deep Encoding for Table Question Answering 2025

Vision-aided Unsupervised Constituency Parsing with Multi-MLLM Debating 2025

Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization

Abstract

Authors

Topics

Keywords

Related papers