2011
NIPS
NeurIPS 2011
Optimal Reinforcement Learning for Gaussian Systems
Abstract
The exploration-exploitation trade-off is among the central challenges of reinforcement learning. The optimal Bayesian solution is intractable in general. This paper studies to what extent analytic statements about optimal learning are possible if all beliefs are Gaussian processes. A first order approximation of learning of both loss and dynamics, for nonlinear, time-varying systems in continuous time and space, subject to a relatively weak restriction on the dynamics, is described by an infinite-dimensional partial differential equation. An approximate finite-dimensional projection gives an impression for how this result may be helpful.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Machine Learning and Reinforcement Learning
📈
Trend Setter
— Probabilistic Modeling
🧭
Keyword Pioneer
— optimal bayesian solution
🐣
Hot Topic Early Bird
— reinforcement learning
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics
Authors
Topics
Artificial Intelligence > Bayesian & Probabilistic > Bayesian Learning
Artificial Intelligence > Bayesian & Probabilistic > Probabilistic Modeling
Machine Learning > Optimization & Theory > Stochastic Processes
Reinforcement Learning > Methods > Deep RL
Reinforcement Learning > Methods > Policy Learning
Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling
Machine Learning > Learning Types > Reinforcement Learning
Machine Learning > Bayesian & Probabilistic > Gaussian Processes