Cardinality-Minimal Explanations for Monotonic Neural Networks

Ouns El Harzli; Bernardo Cuenca Grau; Ian Horrocks

2023 IJCAI IJCAI 2023

Cardinality-Minimal Explanations for Monotonic Neural Networks

Abstract

In recent years, there has been increasing interest in explanation methods for neural model predictions that offer precise formal guarantees. These include abductive (respectively, contrastive) methods, which aim to compute minimal subsets of input features that are sufficient for a given prediction to hold (respectively, to change a given prediction). The corresponding decision problems are, however, known to be intractable. In this paper, we investigate whether tractability can be regained by focusing on neural models implementing a monotonic function. Although the relevant decision problems remain intractable, we can show that they become solvable in polynomial time by means of greedy algorithms if we additionally assume that the activation functions are continuous everywhere and differentiable almost everywhere. Our experiments suggest favourable performance of our algorithms.

🧭 Keyword Pioneer — abductive explanation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Deep Learning, Machine Learning, Natural Language Processing, Reinforcement Learning

Authors

Ouns El Harzli , Bernardo Cuenca Grau , Ian Horrocks

Topics

Artificial Intelligence > Core AI > Interpretability

Keywords

neural network interpretability feature attribution greedy algorithm formal guarantee abductive reasoning abductive explanation monotonic neural network monotonic function explanation method neural network

Download PDF

Related papers

Analyzing Intentional Behavior in Autonomous Agents under Uncertainty 2023

Deep Hashing-based Dynamic Stock Correlation Estimation via Normalizing Flow 2023

U-Match: Two-view Correspondence Learning with Hierarchy-aware Local Context Aggregation 2023

Artificial Agents Inspired by Human Motivation Psychology for Teamwork in Hazardous Environments 2023

Proportionally Fair Online Allocation of Public Goods with Predictions 2023