Guided Monte Carlo Tree Search for Planning in Learned Environments

Jelle Van Eyck; Jan Ramon; Fabian Guiza; Geert MeyFroidt; Maurice Bruynooghe; Greet Van den Berghe

2013 ACML ACML 2013

Guided Monte Carlo Tree Search for Planning in Learned Environments

Abstract

Monte Carlo tree search (MCTS) is a sampling and simulation based technique for searching in large search spaces containing both decision nodes and probabilistic events. This technique has recently become popular due to its successful application to games, e.g. Poker and Go. Such games have known rules and the alternation between self-moves and non-deterministic events or opponent moves can be used to prune uninteresting branches. In this paper we study a real-world setting where the processes in the domain have a high degree of uncertainty and the need for longer-term planning implies a sequence of (planning) decisions without any intermediate feedback. Fortunately, unlike the combinatorial complexity in strategic games, many real-world environments can be approximated by efficient algorithms on a short term. This paper proposes an MCTS variant using a new type of prior information based on estimating the effects of part of the world and explores its application to the problem of hospital planning, where machine learning algorithms can be used to predict the length of stay of patients for each of the different stages of their recovery.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Knowledge & Reasoning and Machine Learning

📈 Trend Setter — Reinforcement Learning

🧭 Keyword Pioneer — planning decision

🐝 Cross-Pollinator — Artificial Intelligence, Deep Learning, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

🐣 Hot Topic Early Bird — monte carlo tree search

Authors

Jelle Van Eyck , Jan Ramon , Fabian Guiza , Geert MeyFroidt , Maurice Bruynooghe , Greet Van den Berghe

Topics

Artificial Intelligence > Core AI > Planning Knowledge & Reasoning > Reasoning > Automated Planning Machine Learning > Learning Types > Reinforcement Learning

Keywords

prior information monte carlo tree search planning decision hospital planning patient length of stay

Download PDF

Related papers

Multilabel Classification through Random Graph Ensembles 2013

Multi-armed Bandit Problem with Lock-up Periods 2013

Generalized Aitchison Embeddings for Histograms 2013

Aggregating Predictions via Sequential Mini-Trading 2013

Linear Approximation to ADMM for MAP inference 2013