Reinforcement Learning with Action-Derived Rewards for Chemotherapy and Clinical Trial Dosing Regimen Selection

Gregory Yauney; Pratik Shah

2018 MLHC MLHC 2018

Reinforcement Learning with Action-Derived Rewards for Chemotherapy and Clinical Trial Dosing Regimen Selection

Abstract

Unstructured learning problems without well-defined rewards are unsuitable for current reinforcement learning (RL) approaches. Action-derived rewards can allow RL agents to fully explore state and action trade-offs in scenarios that require specific outcomes yet are unstructured by external reward. Clinical trial dosing choice is an example of such a problem. We report the successful formulation of clinical trial dosing choice as an RL problem using action-based rewards and learning of dosing regimens to reduce mean tumor diameters (MTD) in patients undergoing simulated temozolomide (TMZ) and procarbazine, 1-(2-chloroethyl)-3-cyclohexyl-l-nitrosourea, and vincristine (PCV) chemo- and radiotherapy clinical trials. The use of action-derived rewards as partial proxies for outcomes is described for the first time. Novel dosing regimens learned by an RL agent in the presence of action-derived rewards achieve significant reduction in MTD for cohorts and individual patients in simulated TMZ and PCV clinical trials while reducing treatment cycle administrations and dosage concentrations compared to human-expert dosing regimens. Our approach can be easily adapted for other learning tasks where outcome-based learning is not practical.

🧭 Keyword Pioneer — treatment optimization

🐣 Hot Topic Early Bird — clinical trial

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Gregory Yauney , Pratik Shah

Topics

Artificial Intelligence > Core AI > Agent Systems Artificial Intelligence > Core AI > Planning

Keywords

reinforcement learning clinical trial treatment optimization action-derived reward chemotherapy dosing dosing regiman

Download PDF

Related papers

3D Point Cloud-Based Visual Prediction of ICU Mobility Care Activities 2018

Disease-Atlas: Navigating Disease Trajectories using Deep Learning 2018

Multi-Label Learning from Medical Plain Text with Convolutional Residual Models 2018

A Domain Guided CNN Architecture for Predicting Age from Structural Brain Images 2018

Learning to Exploit Invariances in Clinical Time-Series Data using Sequence Transformer Networks 2018