Extending Model-based Policy Gradients for Robots in Heteroscedastic Environments

John Martin; Brendan Englot

2017 CORL CoRL 2017

Extending Model-based Policy Gradients for Robots in Heteroscedastic Environments

Abstract

In this paper, we consider the problem of learning robot control policies in heteroscedastic environments, whose noise properties vary throughout a robot’s state and action space. We consider reinforcement learning algorithms that evaluate policies using learned models of the environment, and we extend this class of algorithms to capture heteroscedastic effects with two enchained Gaussian processes. We explore the capabilities and limitations of this approach, and demonstrate that it reduces model bias across a variety of simulated robotic systems.

🚀 Conference Pioneer — CORL 2017

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning and Robotics

🐣 Hot Topic Early Bird — policy gradient

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

John Martin , Brendan Englot

Topics

Machine Learning > Optimization & Theory > Bayesian Inference Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Applications > Robotics Robotics > Capabilities > Manipulation Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Bayesian & Probabilistic > Gaussian Processes

Keywords

policy gradient gaussian process model-based reinforcement learning robot control heteroscedastic noise

Download PDF

Related papers

CORe50: a New Dataset and Benchmark for Continuous Object Recognition 2017

Active Incremental Learning of Robot Movement Primitives 2017

Efficient Automatic Perception System Parameter Tuning On Site without Expert Supervision 2017

Opportunistic Active Learning for Grounding Natural Language Descriptions 2017

Adaptable Pouring: Teaching Robots Not to Spill using Fast but Approximate Fluid Simulation 2017