Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings

Hengrui Cai; Chengchun Shi; Rui Song; Wenbin Lu

2021 NIPS NeurIPS 2021

Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings

Abstract

We consider off-policy evaluation (OPE) in continuous treatment settings, such as personalized dose-finding. In OPE, one aims to estimate the mean outcome under a new treatment decision rule using historical data generated by a different decision rule. Most existing works on OPE focus on discrete treatment settings. To handle continuous treatments, we develop a novel estimation method for OPE using deep jump learning. The key ingredient of our method lies in adaptively discretizing the treatment space using deep discretization, by leveraging deep learning and multi-scale change point detection. This allows us to apply existing OPE methods in discrete treatments to handle continuous treatments. Our method is further justified by theoretical results, simulations, and a real application to Warfarin Dosing.

🌉 Interdisciplinary Bridge — Deep Learning and Healthcare & Medicine and Machine Learning

🧭 Keyword Pioneer — personalized dosing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Hengrui Cai , Chengchun Shi , Rui Song , Wenbin Lu

Topics

Machine Learning > Core Methods > Regression Machine Learning > Optimization & Theory > Statistical Learning Machine Learning > Application Areas > Risk Management Healthcare & Medicine > Clinical > Medical AI Deep Learning > Learning Types > Deep Learning Machine Learning > Learning Types > Causal Inference

Keywords

change point detection causal inference off-policy evaluation continuous treatment personalized dosing dose-response modeling deep discretization deep jump learning

Download PDF

Related papers

Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data 2021

On Model Calibration for Long-Tailed Object Detection and Instance Segmentation 2021

Test-Time Personalization with a Transformer for Human Pose Estimation 2021

NTopo: Mesh-free Topology Optimization using Implicit Neural Representations 2021

Scalable Intervention Target Estimation in Linear Models 2021