Feature Construction for Inverse Reinforcement Learning

Sergey Levine; Zoran Popovic; Vladlen Koltun

2010 NIPS NeurIPS 2010

Feature Construction for Inverse Reinforcement Learning

Abstract

The goal of inverse reinforcement learning is to find a reward function for a Markov decision process, given example traces from its optimal policy. Current IRL techniques generally rely on user-supplied features that form a concise basis for the reward. We present an algorithm that instead constructs reward features from a large collection of component features, by building logical conjunctions of those component features that are relevant to the example policy. Given example traces, the algorithm returns a reward function as well as the constructed features. The reward function can be used to recover a full, deterministic, stationary policy, and the features can be used to transplant the reward function into any novel environment on which the component features are well defined.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — policy recovery

🐣 Hot Topic Early Bird — markov decision process

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

🌱 Topic Pioneer — Applications

📈 Trend Setter — Reasoning

Authors

Sergey Levine , Zoran Popovic , Vladlen Koltun

Topics

Artificial Intelligence > Core AI > Causal Inference Artificial Intelligence > Core AI > Planning Machine Learning > Core Methods > Representation Learning Reinforcement Learning > Methods > Policy Learning Reinforcement Learning > Applications Artificial Intelligence > Core AI > Reasoning Machine Learning > Learning Types > Imitation Learning

Keywords

policy learning inverse reinforcement learning markov decision process reward function feature construction policy recovery policy extraction

Download PDF

Related papers

Link Discovery using Graph Feature Tracking 2010

Trading off Mistakes and Don't-Know Predictions 2010

A Novel Kernel for Learning a Neuron Model from Spike Train Data 2010

Decomposing Isotonic Regression for Efficiently Solving Large Problems 2010

Learning Kernels with Radiuses of Minimum Enclosing Balls 2010