Artificial Intelligence › Core AI ›

Interpretability

7318 directly classified papers

Papers per year

Papers

Evaluating Humanities Theory Alignment in Large Language Models: Incremental Prompting and Statistical Assessment EACL 2026

Beyond the Token: Correcting the Tokenization Bias in XAI via Morphologically-Aligned Projection EACL 2026

Pedagogic Applications of Argument Maps to Enhance Critical Thinking: Thought Seeds, Argument Mapping, Collaborative Mapping EACL 2026

HalluZig: Hallucination Detection using Zigzag Persistence EACL 2026

Revealing Redundant Syntax in Large Language Models through Multi-Hop Dependency Paths EACL 2026

The Curse of Verbalization: How Presentation Order Constrains LLM Reasoning EACL 2026

Say It Another Way: Auditing LLMs with a User-Grounded Automated Paraphrasing Framework EACL 2026

Hallucinations at the Firewall AAAI 2026

Understanding the Management of Rape Trauma with AI and Neuroimaging AAAI 2026

Persona, Ego, Shadow, and Self: A Map of the Soul Framework for Proto-Emotional Homeostasis in AI AAAI 2026

Controllable Epistemic Sensitivity in Large Language Models: Probing, Benchmarking, and Adaptive Reasoning AAAI 2026

OMEGA: An Ontology-Driven Tool for Explaining Multi-Agent Path Finding AAAI 2026

KnowThyself: An Agentic Assistant for LLM Interpretability AAAI 2026

AgentSeer: Visualizing and Evaluating Temporal Actions in Agentic AI Systems AAAI 2026

CLEAR: Error Analysis via LLM-as-a-Judge Made Easy AAAI 2026

Beyond the Black Box: Demystifying Multi-Turn LLM Reasoning with VISTA AAAI 2026

AD2: Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems WACV 2026

Out of Distribution, Out of Luck: Process Rewards Misguide Reasoning Models EACL 2026

If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models EACL 2026

Detecting (Un)answerability in Large Language Models with Linear Directions EACL 2026

SearchLLM: Detecting LLM Paraphrased Text by Measuring the Similarity with Regeneration of the Candidate Source via Search Engine EACL 2026

Tug-of-war between idioms’ figurative and literal interpretations in LLMs EACL 2026

Evidential Semantic Entropy for LLM Uncertainty Quantification EACL 2026

Diagnosing Vision Language Models’ Perception by Leveraging Human Methods for Color Vision Deficiencies EACL 2026

From Detection to Explanation: Modeling Fine-Grained Emotional Social Influence Techniques with LLMs and Human Preferences EACL 2026