Causal information splitting: Engineering proxy features for robustness to distribution shifts

Bijan Mazaheri; Atalanti Mastakouri; Dominik Janzing; Michaela Hardt

2023 UAI UAI 2023

Causal information splitting: Engineering proxy features for robustness to distribution shifts

Abstract

Statistical prediction models are often trained on data that is drawn from different probability distributions than their eventual use cases. One approach to proactively prepare for these shifts harnesses the intuition that causal mechanisms should remain invariant between environments. Here we focus on a challenging setting in which the causal and anticausal variables of the target are unobserved. Leaning on information theory, we develop feature selection and engineering techniques for the observed downstream variables that act as proxies. We identify proxies that help to build stable models and moreover utilize auxiliary training tasks to extract stability-enhancing information from proxies. We demonstrate the effectiveness of our techniques on synthetic and real data.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Mathematics & Optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Bijan Mazaheri , Atalanti Mastakouri , Dominik Janzing , Michaela Hardt

Topics

Artificial Intelligence > Core AI > Causal Inference Machine Learning > Application Areas > Domain Adaptation Mathematics & Optimization > Mathematics > Information Theory

Keywords

information theory causal inference distribution shift feature engineering invariant mechanism proxy feature

Download PDF

Related papers

Memory Mechanism for Unsupervised Anomaly Detection 2023

Semi-supervised learning of partial differential operators and dynamical flows 2023

Composing Efficient, Robust Tests for Policy Selection 2023

Inference for mark-censored temporal point processes 2023

Increasing effect sizes of pairwise conditional independence tests between random vectors 2023