A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia

Giovanni Monea; Maxime Peyrard; Martin Josifoski; Vishrav Chaudhary; Jason Eisner; Emre Kiciman; Hamid Palangi; Barun Patra; Robert West

2024 ACL ACL 2024

A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia

Abstract

AbstractLarge language models (LLMs) have an impressive ability to draw on novel information supplied in their context. Yet the mechanisms underlying this contextual grounding remain unknown, especially in situations where contextual information contradicts factual knowledge stored in the parameters, which LLMs also excel at recalling. Favoring the contextual information is critical for retrieval-augmented generation methods, which enrich the context with up-to-date information, hoping that grounding can rectify outdated or noisy stored knowledge. We present a novel method to study grounding abilities using Fakepedia, a novel dataset of counterfactual texts constructed to clash with a model’s internal parametric knowledge. In this study, we introduce Fakepedia, a counterfactual dataset designed to evaluate grounding abilities when the internal parametric knowledge clashes with the contextual information. We benchmark various LLMs with Fakepedia and conduct a causal mediation analysis of LLM components when answering Fakepedia queries, based on our Masked Grouped Causal Tracing (MGCT) method. Through this analysis, we identify distinct computational patterns between grounded and ungrounded responses. We finally demonstrate that distinguishing grounded from ungrounded responses is achievable through computational analysis alone. Our results, together with existing findings about factual recall mechanisms, provide a coherent narrative of how grounding and factual recall mechanisms interact within LLMs.

❓ The Questioner

🌉 Interdisciplinary Bridge — Artificial Intelligence and Natural Language Processing

🧭 Keyword Pioneer — contextual grounding

🐣 Hot Topic Early Bird — retrieval-augmented generation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Giovanni Monea , Maxime Peyrard , Martin Josifoski , Vishrav Chaudhary , Jason Eisner , Emre Kiciman , Hamid Palangi , Barun Patra , Robert West

Topics

Artificial Intelligence > Core AI > Interpretability Natural Language Processing > Generation > Language Modeling Natural Language Processing > Resources & Methods > Large Language Models Machine Learning > Learning Types > Representation Learning Artificial Intelligence > Core AI > Large Language Models Deep Learning > Models > Large Language Models

Keywords

retrieval-augmented generation contextual grounding factual recall causal mediation parametric knowledge causal tracing large language model

Download PDF

Related papers

Reinforcement Learning-Driven LLM Agent for Automated Attacks on LLMs 2024

EtymoLink: A Structured English Etymology Dataset 2024

Turkish Delights: A Dataset on Turkish Euphemisms 2024

Subjectivity Detection in English News using Large Language Models 2024

Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better 2024