Provenance: A Light-weight Fact-checker for Retrieval Augmented LLM Generation Output

Hithesh Sankararaman; Mohammed Nasheed Yasin; Tanner Sorensen; Alessandro Di Bari; Andreas Stolcke

2024 EMNLP EMNLP 2024

Provenance: A Light-weight Fact-checker for Retrieval Augmented LLM Generation Output

Abstract

AbstractWe present a light-weight approach for detecting nonfactual outputs from retrieval-augemented generation (RAG). Given a context and putative output, we compute a factuality score that can be thresholded to yield a binary decision to check the results of LLM-based question-answering, summarization, or other systems. Unlike factuality checkers that themselves rely on LLMs, we use compact, open-source natural language inference (NLI) models that yield a freely accessible solution with low latency and low cost at run-time, and no need for LLM fine-tuning. The approach also enables downstream mitigation and correction of hallucinations, by tracing them back to specific context chunks. Our experiments show high ROC-AUC across a wide range of relevant open source datasets, indicating the effectiveness of our method for fact-checking RAG output.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Hithesh Sankararaman , Mohammed Nasheed Yasin , Tanner Sorensen , Alessandro Di Bari , Andreas Stolcke

Topics

Artificial Intelligence > Core AI > Interpretability Natural Language Processing > Applications > Fact-Checking Natural Language Processing > Applications > Information Retrieval Machine Learning > Learning Types > Retrieval-Augmented Generation Artificial Intelligence > Core AI > Information Retrieval Deep Learning > Learning Types > Retrieval-Augmented Generation

Keywords

natural language inference question answering retrieval-augmented generation hallucination detection factuality detection

Download PDF

Related papers

EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation 2024

Learning to Extract Structured Entities Using Language Models 2024

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis 2024

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages 2024