Causal Bandits: Learning Good Interventions via Causal Inference

Finnian Lattimore; Tor Lattimore; Mark D. Reid

2016 NIPS NeurIPS 2016

Causal Bandits: Learning Good Interventions via Causal Inference

Abstract

We study the problem of using causal models to improve the rate at which good interventions can be learned online in a stochastic environment. Our formalism combines multi-arm bandits and causal inference to model a novel type of bandit feedback that is not exploited by existing approaches. We propose a new algorithm that exploits the causal feedback and prove a bound on its simple regret that is strictly better (in all quantities) than algorithms that do not use the additional causal information.

🌉 Interdisciplinary Bridge — Knowledge & Reasoning and Machine Learning

🧭 Keyword Pioneer — causal bandit

🐣 Hot Topic Early Bird — causal inference

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Finnian Lattimore , Tor Lattimore , Mark D. Reid

Topics

Machine Learning > Optimization & Theory > Learning Theory Knowledge & Reasoning > Reasoning > Causal Inference Machine Learning > Learning Types > Multi-Armed Bandits Machine Learning > Learning Types > Causal Inference

Keywords

causal inference online learning multi-armed bandit regret bound causal model causal bandit simple regret

Download PDF

Related papers

Bayesian Intermittent Demand Forecasting for Large Inventories 2016

Dynamic Network Surgery for Efficient DNNs 2016

Beyond Exchangeability: The Chinese Voting Process 2016

Safe and Efficient Off-Policy Reinforcement Learning 2016

Tagger: Deep Unsupervised Perceptual Grouping 2016