Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core Methods
Machine Learning
›
Core Methods
›
Interpretability
349 directly classified papers
Papers per year
2008: 1
2014: 1
2015: 2
2016: 4
2017: 4
2018: 10
2019: 29
2020: 41
2021: 40
2022: 65
2023: 55
2024: 56
2025: 41
Papers
Easy to Decide, Hard to Agree: Reducing Disagreements Between Saliency Methods
ACL 2023
Sample based Explanations via Generalized Representers
NIPS 2023
Black-box language model explanation by context length probing
ACL 2023
Class based Influence Functions for Error Detection
ACL 2023
Explaining How Transformers Use Context to Build Predictions
ACL 2023
Efficient Shapley Values Estimation by Amortization for Text Classification
ACL 2023
COCKATIEL: COntinuous Concept ranKed ATtribution with Interpretable ELements for explaining neural net classifiers on NLP
ACL 2023
On the Interpretability and Significance of Bias Metrics in Texts: a PMI-based Approach
ACL 2023
Labeling Neural Representations with Inverse Recognition
NIPS 2023
Unsupervised Selective Rationalization with Noise Injection
ACL 2023
The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources
ACL 2023
Log-linear Guardedness and its Implications
ACL 2023
CREST: A Joint Framework for Rationalization and Counterfactual Text Generation
ACL 2023
Contrastive Learning with Adversarial Examples for Alleviating Pathology of Language Model
ACL 2023
Trade-off Between Efficiency and Consistency for Removal-based Explanations
NIPS 2023
Probing Representations for Document-level Event Extraction
EMNLP 2023
Rethinking the Construction of Effective Metrics for Understanding the Mechanisms of Pretrained Language Models
EMNLP 2023
REV: Information-Theoretic Evaluation of Free-Text Rationales
ACL 2023
GaitGCI: Generative Counterfactual Intervention for Gait Recognition
CVPR 2023
SketchXAI: A First Look at Explainability for Human Sketches
CVPR 2023
The Analysis of Deep Neural Networks by Information Theory: From Explainability to Generalization
AAAI 2023
Accountability Layers: Explaining Complex System Failures by Parts
AAAI 2023
Monitoring Model Deterioration with Explainable Uncertainty Estimation via Non-parametric Bootstrap
AAAI 2023
Testing the Channels of Convolutional Neural Networks
AAAI 2023
IAEval: A Comprehensive Evaluation of Instance Attribution on Natural Language Understanding
EMNLP 2023
<
1
…
4
5
6
…
14
>