Algorithms for Learning Markov Field Policies

Abdeslam Boularias; Jan R. Peters; Oliver B. Kroemer

2012 NIPS NeurIPS 2012

Algorithms for Learning Markov Field Policies

Abstract

We present a new graph-based approach for incorporating domain knowledge in reinforcement learning applications. The domain knowledge is given as a weighted graph, or a kernel matrix, that loosely indicates which states should have similar optimal actions. We first introduce a bias into the policy search process by deriving a distribution on policies such that policies that disagree with the provided graph have low probabilities. This distribution corresponds to a Markov Random Field. We then present a reinforcement and an apprenticeship learning algorithms for finding such policy distributions. We also illustrate the advantage of the proposed approach on three problems: swing-up cart-balancing with nonuniform and smooth frictions, gridworlds, and teaching a robot to grasp new objects.

🌉 Interdisciplinary Bridge — Reinforcement Learning and Robotics

🧭 Keyword Pioneer — robot grasping

🐣 Hot Topic Early Bird — reinforcement learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

📈 Trend Setter — Robotics

Authors

Abdeslam Boularias , Jan R. Peters , Oliver B. Kroemer

Topics

Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Methods > Policy Learning Reinforcement Learning > Applications > Robotics Robotics Robotics > Capabilities > Manipulation Machine Learning > Core Methods > Graphical Models Machine Learning > Learning Types > Multi-Agent Systems

Keywords

reinforcement learning policy learning policy search apprenticeship learning robot grasping markov random field graph kernel graph-based policy

Download PDF

Related papers

Kernel Hyperalignment 2012

Fused sparsity and robust estimation for linear models with unknown variance 2012

Slice sampling normalized kernel-weighted completely random measure mixture models 2012

Scaling MPE Inference for Constrained Continuous Markov Random Fields with Consensus Optimization 2012

Matrix reconstruction with the local max norm 2012