Reinforcement Learning of Causal Variables Using Mediation Analysis

Tue Herlau; Rasmus Larsen

2022 AAAI AAAI 2022

Reinforcement Learning of Causal Variables Using Mediation Analysis

Abstract

Abstract We consider the problem of acquiring causal representations and concepts in a reinforcement learning setting. Our approach defines a causal variable as being both manipulable by a policy, and able to predict the outcome. We thereby obtain a parsimonious causal graph in which interventions occur at the level of policies. The approach avoids defining a generative model of the data, prior pre-processing, or learning the transition kernel of the Markov decision process. Instead, causal variables and policies are determined by maximizing a new optimization target inspired by mediation analysis, which differs from the expected return. The maximization is accomplished using a generalization of Bellman's equation which is shown to converge, and the method finds meaningful causal representations in a simulated environment.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Knowledge & Reasoning and Machine Learning

🧭 Keyword Pioneer — causal variable

🐣 Hot Topic Early Bird — causal graph

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Tue Herlau , Rasmus Larsen

Topics

Artificial Intelligence > Core AI > Causal Inference Knowledge & Reasoning > Reasoning > Causal Inference Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Learning Types > Causal Inference

Keywords

causal inference markov decision process bellman equation causal graph causal representation mediation analysis causal variable

Download PDF

Related papers

Dynamic Spatial Propagation Network for Depth Completion 2022

FedFR: Joint Optimization Federated Framework for Generic and Personalized Face Recognition 2022

Memory-Guided Semantic Learning Network for Temporal Sentence Grounding 2022

AnchorFace: Boosting TAR@FAR for Practical Face Recognition 2022

Parallel and High-Fidelity Text-to-Lip Generation 2022