Lipschitz Continuity in Model-based Reinforcement Learning

Kavosh Asadi; Dipendra Misra; Michael Littman

2018 ICML ICML 2018

Lipschitz Continuity in Model-based Reinforcement Learning

Abstract

We examine the impact of learning Lipschitz continuous models in the context of model-based reinforcement learning. We provide a novel bound on multi-step prediction error of Lipschitz models where we quantify the error using the Wasserstein metric. We go on to prove an error bound for the value-function estimate arising from Lipschitz models and show that the estimated value function is itself Lipschitz. We conclude with empirical results that show the benefits of controlling the Lipschitz constant of neural-network models.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — wasserstein metric

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

🐣 Hot Topic Early Bird — value function

Authors

Kavosh Asadi , Dipendra Misra , Michael Littman

Topics

Machine Learning > Optimization & Theory > Theory Reinforcement Learning > Methods > Deep RL Artificial Intelligence > Core AI > Robotics

Keywords

value function model-based reinforcement learning lipschitz continuity wasserstein metric prediction error neural network multi-step prediction

Download PDF

Related papers

Rectify Heterogeneous Models with Semantic Mapping 2018

Bayesian Optimization of Combinatorial Structures 2018

The Well-Tempered Lasso 2018

Approximation Algorithms for Cascading Prediction Models 2018

Classification from Pairwise Similarity and Unlabeled Data 2018