2011
NIPS
NeurIPS 2011
Nonlinear Inverse Reinforcement Learning with Gaussian Processes
Abstract
We present a probabilistic algorithm for nonlinear inverse reinforcement learning. The goal of inverse reinforcement learning is to learn the reward function in a Markov decision process from expert demonstrations. While most prior inverse reinforcement learning algorithms represent the reward as a linear combination of a set of features, we use Gaussian processes to learn the reward as a nonlinear function, while also determining the relevance of each feature to the expert's policy. Our probabilistic algorithm allows complex behaviors to be captured from suboptimal stochastic demonstrations, while automatically balancing the simplicity of the learned reward structure against its consistency with the observed actions.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Reinforcement Learning
🧭
Keyword Pioneer
— reward learning
🐣
Hot Topic Early Bird
— markov decision process
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics
Authors
Topics
Artificial Intelligence > Core AI > Planning
Artificial Intelligence > Bayesian & Probabilistic > Bayesian Learning
Reinforcement Learning > Methods > Deep RL
Reinforcement Learning > Methods > Policy Learning
Machine Learning > Bayesian & Probabilistic > Bayesian Learning
Machine Learning > Learning Types > Reinforcement Learning
Machine Learning > Bayesian & Probabilistic > Gaussian Processes