Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core Methods
Machine Learning
›
Core Methods
›
Interpretability
349 directly classified papers
Papers per year
2008: 1
2014: 1
2015: 2
2016: 4
2017: 4
2018: 10
2019: 29
2020: 41
2021: 40
2022: 65
2023: 55
2024: 56
2025: 41
Papers
Shortcomings of Top-Down Randomization-Based Sanity Checks for Evaluations of Deep Neural Network Explanations
CVPR 2023
Defining and Quantifying the Emergence of Sparse Concepts in DNNs
CVPR 2023
AttCAT: Explaining Transformers via Attentive Class Activation Tokens
NIPS 2022
Are representations built from the ground up? An empirical examination of local composition in language models
EMNLP 2022
Deformable ProtoPNet: An Interpretable Image Classifier Using Deformable Prototypes
CVPR 2022
Explainable Metaphor Identification Inspired by Conceptual Metaphor Theory
AAAI 2022
Fast and More Powerful Selective Inference for Sparse High-Order Interaction Model
AAAI 2022
Analyzing the Use of Influence Functions for Instance-Specific Data Filtering in Neural Machine Translation
EMNLP 2022
Tracing and Manipulating intermediate values in Neural Math Problem Solvers
EMNLP 2022
Probing for targeted syntactic knowledge through grammatical error detection
EMNLP 2022
Identifying the Source of Vulnerability in Explanation Discrepancy: A Case Study in Neural Text Classification
EMNLP 2022
Influence Functions for Sequence Tagging Models
EMNLP 2022
Acceptability Judgements via Examining the Topology of Attention Maps
EMNLP 2022
Do Feature Attribution Methods Correctly Attribute Features?
AAAI 2022
ProtGNN: Towards Self-Explaining Graph Neural Networks
AAAI 2022
Rethinking Influence Functions of Neural Networks in the Over-Parameterized Regime
AAAI 2022
Constraint-Driven Explanations for Black-Box ML Models
AAAI 2022
Entropy-Based Logic Explanations of Neural Networks
AAAI 2022
Sufficient Reasons for Classifier Decisions in the Presence of Domain Constraints
AAAI 2022
FOCUS: Flexible Optimizable Counterfactual Explanations for Tree Ensembles
AAAI 2022
Explain, Edit, and Understand: Rethinking User Study Design for Evaluating Model Explanations
AAAI 2022
How to Find a Good Explanation for Clustering?
AAAI 2022
Optimizing Binary Decision Diagrams with MaxSAT for Classification
AAAI 2022
An Unsupervised Way to Understand Artifact Generating Internal Units in Generative Neural Networks
AAAI 2022
Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers
AAAI 2022
<
1
…
6
7
8
…
14
>