2010
NIPS
NeurIPS 2010
Nonparametric Bayesian Policy Priors for Reinforcement Learning
Abstract
We consider reinforcement learning in partially observable domains where the agent can query an expert for demonstrations. Our nonparametric Bayesian approach combines model knowledge, inferred from expert information and independent exploration, with policy knowledge inferred from expert trajectories. We introduce priors that bias the agent towards models with both simple representations and simple policies, resulting in improved policy and model learning.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Machine Learning and Reinforcement Learning
📈
Trend Setter
— Self-Supervised Learning
🧭
Keyword Pioneer
— partially observable domains
🐣
Hot Topic Early Bird
— reinforcement learning
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics