Towards Knowledge Checking in Retrieval-augmented Generation: A Representation Perspective

Shenglai Zeng; Jiankun Zhang; Bingheng Li; Yuping Lin; Tianqi Zheng; Dante Everaert; Hanqing Lu; Hui LIU; Yue Xing; Monica Xiao Cheng; Jiliang Tang

2025 NAACL NAACL 2025

Towards Knowledge Checking in Retrieval-augmented Generation: A Representation Perspective

Abstract

AbstractRetrieval-Augmented Generation (RAG) systems have shown promise in enhancing the performance of Large Language Models (LLMs). However, these systems face challenges in effectively integrating external knowledge with the LLM’s internal knowledge, often leading to issues with misleading or unhelpful information. This work aims to provide a systematic study on knowledge checking in RAG systems. We conduct a comprehensive analysis of LLM representation behaviors and demonstrate the significance of using representations in knowledge checking. Motivated by the findings, we further develop representation-based classifiers for knowledge filtering. We show substantial improvements in RAG performance, even when dealing with noisy knowledge databases. Our study provides new insights into leveraging LLM representations for enhancing the reliability and effectiveness of RAG systems.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — knowledge checking

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Shenglai Zeng , Jiankun Zhang , Bingheng Li , Yuping Lin , Tianqi Zheng , Dante Everaert , Hanqing Lu , Hui LIU , Yue Xing , Monica Xiao Cheng , Jiliang Tang

Topics

Natural Language Processing > Applications > Fact-Checking Natural Language Processing > Resources & Methods > Knowledge Editing Natural Language Processing > Resources & Methods > Large Language Models Machine Learning > Learning Types > Representation Learning Artificial Intelligence > Core AI > Knowledge Representation Natural Language Processing > Resources & Methods > Retrieval-Augmented Generation

Keywords

representation learning natural language inference retrieval-augmented generation knowledge filtering large language model knowledge checking

Download PDF

Few-shot Personalization of LLMs with Mis-aligned Responses 2025

NLI under the Microscope: What Atomic Hypothesis Decomposition Reveals 2025

Understanding Figurative Meaning through Explainable Visual Entailment 2025

CogLM: Tracking Cognitive Development of Large Language Models 2025

Towards Knowledge Checking in Retrieval-augmented Generation: A Representation Perspective

Abstract

Authors

Topics

Keywords

Related papers