Bayesian Hierarchical Reinforcement Learning

Feng Cao; Soumya Ray

2012 NIPS NeurIPS 2012

Bayesian Hierarchical Reinforcement Learning

Abstract

We describe an approach to incorporating Bayesian priors in the maxq framework for hierarchical reinforcement learning (HRL). We define priors on the primitive environment model and on task pseudo-rewards. Since models for composite tasks can be complex, we use a mixed model-based/model-free learning approach to find an optimal hierarchical policy. We show empirically that (i) our approach results in improved convergence over non-Bayesian baselines, given sensible priors, (ii) task hierarchies and Bayesian priors can be complementary sources of information, and using both sources is better than either alone, (iii) taking advantage of the structural decomposition induced by the task hierarchy significantly reduces the computational cost of Bayesian reinforcement learning and (iv) in this framework, task pseudo-rewards can be learned instead of being manually specified, leading to automatic learning of hierarchically optimal rather than recursively optimal policies.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Reinforcement Learning

🧭 Keyword Pioneer — hierarchical reinforcement learning

🐝 Cross-Pollinator — Artificial Intelligence, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Reinforcement Learning, Robotics

📈 Trend Setter — Meta-Learning

🐣 Hot Topic Early Bird — probabilistic modeling

Authors

Feng Cao , Soumya Ray

Topics

Artificial Intelligence > Bayesian & Probabilistic > Bayesian Learning Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Methods > Policy Learning Machine Learning > Bayesian & Probabilistic > Bayesian Learning Machine Learning > Learning Paradigms > Meta-Learning Machine Learning > Learning Types > Multi-Task Learning Machine Learning > Learning Types > Reinforcement Learning Artificial Intelligence > Bayesian & Probabilistic > Bayesian Inference

Keywords

probabilistic modeling bayesian learning bayesian inference bayesian reinforcement learning model-based learning policy learning hierarchical reinforcement learning model-based reinforcement learning maxq framework priors bayesian prior

Download PDF

Related papers

Kernel Hyperalignment 2012

Fused sparsity and robust estimation for linear models with unknown variance 2012

Slice sampling normalized kernel-weighted completely random measure mixture models 2012

Scaling MPE Inference for Constrained Continuous Markov Random Fields with Consensus Optimization 2012

Matrix reconstruction with the local max norm 2012