Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Interpretability
7318 directly classified papers
Papers per year
2003: 1
2006: 1
2007: 1
2008: 1
2009: 1
2010: 5
2012: 2
2013: 10
2014: 7
2015: 14
2016: 27
2017: 84
2018: 196
2019: 395
2020: 488
2021: 771
2022: 823
2023: 954
2024: 1360
2025: 1713
2026: 464
Papers
Evaluating Humanities Theory Alignment in Large Language Models: Incremental Prompting and Statistical Assessment
EACL 2026
Beyond the Token: Correcting the Tokenization Bias in XAI via Morphologically-Aligned Projection
EACL 2026
Pedagogic Applications of Argument Maps to Enhance Critical Thinking: Thought Seeds, Argument Mapping, Collaborative Mapping
EACL 2026
HalluZig: Hallucination Detection using Zigzag Persistence
EACL 2026
Revealing Redundant Syntax in Large Language Models through Multi-Hop Dependency Paths
EACL 2026
The Curse of Verbalization: How Presentation Order Constrains LLM Reasoning
EACL 2026
Say It Another Way: Auditing LLMs with a User-Grounded Automated Paraphrasing Framework
EACL 2026
Hallucinations at the Firewall
AAAI 2026
Understanding the Management of Rape Trauma with AI and Neuroimaging
AAAI 2026
Persona, Ego, Shadow, and Self: A Map of the Soul Framework for Proto-Emotional Homeostasis in AI
AAAI 2026
Controllable Epistemic Sensitivity in Large Language Models: Probing, Benchmarking, and Adaptive Reasoning
AAAI 2026
OMEGA: An Ontology-Driven Tool for Explaining Multi-Agent Path Finding
AAAI 2026
KnowThyself: An Agentic Assistant for LLM Interpretability
AAAI 2026
AgentSeer: Visualizing and Evaluating Temporal Actions in Agentic AI Systems
AAAI 2026
CLEAR: Error Analysis via LLM-as-a-Judge Made Easy
AAAI 2026
Beyond the Black Box: Demystifying Multi-Turn LLM Reasoning with VISTA
AAAI 2026
AD2: Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems
WACV 2026
Out of Distribution, Out of Luck: Process Rewards Misguide Reasoning Models
EACL 2026
If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models
EACL 2026
Detecting (Un)answerability in Large Language Models with Linear Directions
EACL 2026
SearchLLM: Detecting LLM Paraphrased Text by Measuring the Similarity with Regeneration of the Candidate Source via Search Engine
EACL 2026
Tug-of-war between idioms’ figurative and literal interpretations in LLMs
EACL 2026
Evidential Semantic Entropy for LLM Uncertainty Quantification
EACL 2026
Diagnosing Vision Language Models’ Perception by Leveraging Human Methods for Color Vision Deficiencies
EACL 2026
From Detection to Explanation: Modeling Fine-Grained Emotional Social Influence Techniques with LLMs and Human Preferences
EACL 2026
<
1
…
4
5
6
…
293
>