Unconditional Truthfulness: Learning Unconditional Uncertainty of Large Language Models

Artem Vazhentsev; Ekaterina Fadeeva; Rui Xing; Gleb Kuzmin; Ivan Lazichny; Alexander Panchenko; Preslav Nakov; Timothy Baldwin; Maxim Panov; Artem Shelmanov

2025 EMNLP EMNLP 2025

Unconditional Truthfulness: Learning Unconditional Uncertainty of Large Language Models

Abstract

AbstractUncertainty quantification (UQ) has emerged as a promising approach for detecting hallucinations and low-quality output of Large Language Models (LLMs). However, obtaining proper uncertainty scores is complicated by the conditional dependency between the generation steps of an autoregressive LLM, because it is hard to model it explicitly. Here, we propose to learn this dependency from attention-based features. In particular, we train a regression model that leverages LLM attention maps, probabilities on the current generation step, and recurrently computed uncertainty scores from previously generated tokens. To incorporate the recurrent features, we also suggest a two-staged training procedure. Our experimental evaluation on ten datasets and three LLMs shows that the proposed method is highly effective for selective generation, achieving substantial improvements over rivaling unsupervised and supervised approaches.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🧭 Keyword Pioneer — recurrent uncertainty

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Artem Vazhentsev , Ekaterina Fadeeva , Rui Xing , Gleb Kuzmin , Ivan Lazichny , Alexander Panchenko , Preslav Nakov , Timothy Baldwin , Maxim Panov , Artem Shelmanov

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Core Methods > Regression Machine Learning > Optimization & Theory > Stochastic Processes Machine Learning > Learning Types > Representation Learning Artificial Intelligence > Core AI > Large Language Models Machine Learning > Learning Types > Uncertainty Quantification

Keywords

uncertainty quantification attention mechanism autoregressive generation autoregressive model hallucination detection attention map large language model selective generation recurrent uncertainty

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025