Reassess Summary Factual Inconsistency Detection with Large Language Model

Jiuding Yang; Hui LIU; Weidong Guo; Zhuwei Rao; Yu Xu; Di Niu

2024 ACL ACL 2024

Reassess Summary Factual Inconsistency Detection with Large Language Model

Abstract

AbstractEnsuring factual consistency between the summary and the original document is paramount in summarization tasks. Consequently, considerable effort has been dedicated to detecting inconsistencies. With the advent of Large Language Models (LLMs), recent studies have begun to leverage their advanced language understanding capabilities for inconsistency detection. However, early attempts have shown that LLMs underperform traditional models due to their limited ability to follow instructions and the absence of an effective detection methodology. In this study, we reassess summary inconsistency detection with LLMs, comparing the performances of GPT-3.5 and GPT-4. To advance research in LLM-based inconsistency detection, we propose SIFiD (Summary Inconsistency Detection with Filtered Document) that identify key sentences within documents by either employing natural language inference or measuring semantic similarity between summaries and documents.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jiuding Yang , Hui LIU , Weidong Guo , Zhuwei Rao , Yu Xu , Di Niu

Topics

Natural Language Processing > Generation > Summarization Natural Language Processing > Applications > Fact-Checking Natural Language Processing > Applications > Summarization Natural Language Processing > Understanding > Natural Language Inference Artificial Intelligence > Core AI > Natural Language Processing Machine Learning > Learning Types > Natural Language Inference

Keywords

natural language inference semantic similarity factual consistency factual inconsistency large language model inconsistency detection

Download PDF

Related papers

Reinforcement Learning-Driven LLM Agent for Automated Attacks on LLMs 2024

EtymoLink: A Structured English Etymology Dataset 2024

Turkish Delights: A Dataset on Turkish Euphemisms 2024

Subjectivity Detection in English News using Large Language Models 2024

Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better 2024