Regularizing Trajectory Optimization with Denoising Autoencoders

Rinu Boney; Norman Di Palo; Mathias Berglund; Alexander Ilin; Juho Kannala; Antti Rasmus; Harri Valpola

2019 NIPS NeurIPS 2019

Regularizing Trajectory Optimization with Denoising Autoencoders

Abstract

Trajectory optimization using a learned model of the environment is one of the core elements of model-based reinforcement learning. This procedure often suffers from exploiting inaccuracies of the learned model. We propose to regularize trajectory optimization by means of a denoising autoencoder that is trained on the same trajectories as the model of the environment. We show that the proposed regularization leads to improved planning with both gradient-based and gradient-free optimizers. We also demonstrate that using regularized trajectory optimization leads to rapid initial learning in a set of popular motor control tasks, which suggests that the proposed approach can be a useful tool for improving sample efficiency.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Reinforcement Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Rinu Boney , Norman Di Palo , Mathias Berglund , Alexander Ilin , Juho Kannala , Antti Rasmus , Harri Valpola

Topics

Machine Learning > Learning Types > Self-Supervised Learning Reinforcement Learning > Applications > Robotics Machine Learning > Learning Types > Model-Based RL Deep Learning > Models > Autoencoders

Keywords

sample efficiency trajectory optimization model-based reinforcement learning denoising autoencoder motion control

Download PDF

Related papers

Two Generator Game: Learning to Sample via Linear Goodness-of-Fit Test 2019

Metalearned Neural Memory 2019

Model Similarity Mitigates Test Set Overuse 2019

Continual Unsupervised Representation Learning 2019

Reinforcement Learning with Convex Constraints 2019