CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems

Sagar Patel; Sangeetha Abdu Jyothi; Nina Narodytska

2024 AAAI AAAI 2024

CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems

Abstract

Abstract We present CrystalBox, a novel, model-agnostic, posthoc explainability framework for Deep Reinforcement Learning (DRL) controllers in the large family of input-driven environments which includes computer systems. We combine the natural decomposability of reward functions in input-driven environments with the explanatory power of decomposed returns. We propose an efficient algorithm to generate future-based explanations across both discrete and continuous control environments. Using applications such as adaptive bitrate streaming and congestion control, we demonstrate CrystalBox's capability to generate high-fidelity explanations. We further illustrate its higher utility across three practical use cases: contrastive explanations, network observability, and guided reward design, as opposed to prior explainability techniques that identify salient features.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Reinforcement Learning

🧭 Keyword Pioneer — future-based explanation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Sagar Patel , Sangeetha Abdu Jyothi , Nina Narodytska

Topics

Artificial Intelligence > Core AI > Interpretability Deep Learning > Architectures > Neural Networks Reinforcement Learning > Methods > Deep RL Deep Learning > Learning Types > Reinforcement Learning

Keywords

deep reinforcement learning feature attribution reward decomposition contrastive explanation model-agnostic explanation future-based explanation input-driven environment posthoc explainability decomposed return

Download PDF

Related papers

Goal Alignment: Re-analyzing Value Alignment Problems Using Human-Aware AI 2024

Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables 2024

Suppressing Uncertainty in Gaze Estimation 2024

Mask-Homo: Pseudo Plane Mask-Guided Unsupervised Multi-Homography Estimation 2024

Heterogeneous Test-Time Training for Multi-Modal Person Re-identification 2024