Policy Search for Motor Primitives in Robotics

Jens Kober; Jan R. Peters

2008 NIPS NeurIPS 2008

Policy Search for Motor Primitives in Robotics

Abstract

Many motor skills in humanoid robotics can be learned using parametrized motor primitives as done in imitation learning. However, most interesting motor learning problems are high-dimensional reinforcement learning problems often beyond the reach of current methods. In this paper, we extend previous work on policy learning from the immediate reward case to episodic reinforcement learning. We show that this results into a general, common framework also connected to policy gradient methods and yielding a novel algorithm for policy learning by assuming a form of exploration that is particularly well-suited for dynamic motor primitives. The resulting algorithm is an EM-inspired algorithm applicable in complex motor learning tasks. We compare this algorithm to alternative parametrized policy search methods and show that it outperforms previous methods. We apply it in the context of motor learning and show that it can learn a complex Ball-in-a-Cup task using a real Barrett WAM robot arm.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning and Robotics

📈 Trend Setter — Self-Supervised Learning

🧭 Keyword Pioneer — motor primitives

🐣 Hot Topic Early Bird — reinforcement learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Jens Kober , Jan R. Peters

Topics

Machine Learning > Learning Types > Self-Supervised Learning Reinforcement Learning > Methods > Policy Learning Reinforcement Learning > Applications > Robotics Robotics > Capabilities > Manipulation Artificial Intelligence > Core AI > Robotics

Keywords

reinforcement learning imitation learning robotic manipulation em algorithm policy search motor primitive

Download PDF

Related papers

On the Efficient Minimization of Classification Calibrated Surrogates 2008

Hebbian Learning of Bayes Optimal Decisions 2008

Biasing Approximate Dynamic Programming with a Lower Discount Factor 2008

Counting Solution Clusters in Graph Coloring Problems Using Belief Propagation 2008

Domain Adaptation with Multiple Sources 2008