Fast and Accurate Factual Inconsistency Detection Over Long Documents

Barrett Lattimer; Patrick H. Chen; Xinyuan Zhang; Yi Yang

2023 EMNLP EMNLP 2023

Fast and Accurate Factual Inconsistency Detection Over Long Documents

Abstract

AbstractGenerative AI models exhibit remarkable potential; however, hallucinations across various tasks present a significant challenge, particularly for longer inputs that current approaches struggle to address effectively. We introduce SCALE (Source Chunking Approach for Large-scale inconsistency Evaluation), a task-agnostic model for detecting factual inconsistencies using a novel chunking strategy. Specifically, SCALE is a Natural Language Inference (NLI) based model that uses large text chunks to condition over long texts. This approach achieves state-of-the-art performance in factual inconsistency detection for diverse tasks and long inputs. Additionally, we leverage the chunking mechanism and employ a novel algorithm to explain SCALE’s decisions through relevant source sentence retrieval. Our evaluations reveal that SCALE outperforms existing methods on both standard benchmarks and a new long-form dialogue dataset ScreenEval we constructed. Moreover, SCALE surpasses competitive systems in efficiency and model explanation evaluations. We have released our code and data publicly to GitHub.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — source sentence retrieval

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Barrett Lattimer , Patrick H. Chen , Xinyuan Zhang , Yi Yang

Topics

Machine Learning > Core Methods > Classification Natural Language Processing > Applications > Fact-Checking Natural Language Processing > Applications > Question Answering Natural Language Processing > Applications > Natural Language Inference Artificial Intelligence > Core AI > Natural Language Processing Deep Learning > Learning Types > Retrieval-Augmented Generation

Keywords

natural language inference hallucination detection text chunking long document factual inconsistency detection source sentence retrieval

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023