2016
NIPS
NeurIPS 2016
Causal Bandits: Learning Good Interventions via Causal Inference
Abstract
We study the problem of using causal models to improve the rate at which good interventions can be learned online in a stochastic environment. Our formalism combines multi-arm bandits and causal inference to model a novel type of bandit feedback that is not exploited by existing approaches. We propose a new algorithm that exploits the causal feedback and prove a bound on its simple regret that is strictly better (in all quantities) than algorithms that do not use the additional causal information.
🌉
Interdisciplinary Bridge
— Knowledge & Reasoning and Machine Learning
🧭
Keyword Pioneer
— causal bandit
🐣
Hot Topic Early Bird
— causal inference
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio