← Core Methods

Machine Learning › Core Methods ›

Interpretability

349 directly classified papers

Papers per year

Papers

Easy to Decide, Hard to Agree: Reducing Disagreements Between Saliency Methods ACL 2023

Sample based Explanations via Generalized Representers NIPS 2023

Black-box language model explanation by context length probing ACL 2023

Class based Influence Functions for Error Detection ACL 2023

Explaining How Transformers Use Context to Build Predictions ACL 2023

Efficient Shapley Values Estimation by Amortization for Text Classification ACL 2023

COCKATIEL: COntinuous Concept ranKed ATtribution with Interpretable ELements for explaining neural net classifiers on NLP ACL 2023

On the Interpretability and Significance of Bias Metrics in Texts: a PMI-based Approach ACL 2023

Labeling Neural Representations with Inverse Recognition NIPS 2023

Unsupervised Selective Rationalization with Noise Injection ACL 2023

The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources ACL 2023

Log-linear Guardedness and its Implications ACL 2023

CREST: A Joint Framework for Rationalization and Counterfactual Text Generation ACL 2023

Contrastive Learning with Adversarial Examples for Alleviating Pathology of Language Model ACL 2023

Trade-off Between Efficiency and Consistency for Removal-based Explanations NIPS 2023

Probing Representations for Document-level Event Extraction EMNLP 2023

Rethinking the Construction of Effective Metrics for Understanding the Mechanisms of Pretrained Language Models EMNLP 2023

REV: Information-Theoretic Evaluation of Free-Text Rationales ACL 2023

GaitGCI: Generative Counterfactual Intervention for Gait Recognition CVPR 2023

SketchXAI: A First Look at Explainability for Human Sketches CVPR 2023

The Analysis of Deep Neural Networks by Information Theory: From Explainability to Generalization AAAI 2023

Accountability Layers: Explaining Complex System Failures by Parts AAAI 2023

Monitoring Model Deterioration with Explainable Uncertainty Estimation via Non-parametric Bootstrap AAAI 2023

Testing the Channels of Convolutional Neural Networks AAAI 2023

IAEval: A Comprehensive Evaluation of Instance Attribution on Natural Language Understanding EMNLP 2023