2020
AAAI
AAAI 2020
Point-Based Methods for Model Checking in Partially Observable Markov Decision Processes
Abstract
Abstract Autonomous systems are often required to operate in partially observable environments. They must reliably execute a specified objective even with incomplete information about the state of the environment. We propose a methodology to synthesize policies that satisfy a linear temporal logic formula in a partially observable Markov decision process (POMDP). By formulating a planning problem, we show how to use point-based value iteration methods to efficiently approximate the maximum probability of satisfying a desired logical formula and compute the associated belief state policy. We demonstrate that our method scales to large POMDP domains and provides strong bounds on the performance of the resulting policy.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Machine Learning and Mathematics & Optimization and Reinforcement Learning
🧭
Keyword Pioneer
— belief state policy
🐣
Hot Topic Early Bird
— partially observable markov decision process
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Artificial Intelligence > Core AI > Planning
Reinforcement Learning > Methods > Deep RL
Reinforcement Learning > Applications > Value Iteration
Machine Learning > Learning Types > Reinforcement Learning
Mathematics & Optimization > Optimization > Optimization
Mathematics & Optimization > Optimization > Optimal Control