Autonomous Exploration For Navigating In MDPs

Shiau Hong Lim; Peter Auer

2012 COLT COLT 2012

Autonomous Exploration For Navigating In MDPs

Abstract

While intrinsically motivated learning agents hold considerable promise to overcome limitations of more supervised learning systems, quantitative evaluation and theoretical analysis of such agents are difficult. We propose to consider a restricted setting for autonomous learning where systematic evaluation of learning performance is possible. In this setting the agent needs to learn to navigate in a Markov Decision Process where extrinsic rewards are not present or are ignored. We present a learning algorithm for this scenario and evaluate it by the amount of exploration it uses to learn the environment.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

📈 Trend Setter — Reinforcement Learning

🧭 Keyword Pioneer — exploration algorithm

🐣 Hot Topic Early Bird — reinforcement learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

Authors

Shiau Hong Lim , Peter Auer

Topics

Artificial Intelligence > Core AI > Agent Systems Reinforcement Learning > Methods > Policy Learning Reinforcement Learning > Applications > Robotics Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Optimization & Theory > Online Algorithms

Keywords

reinforcement learning markov decision process autonomous exploration intrinsic motivation exploration algorithm exploration strategy

Download PDF

Related papers

Unsupervised SVMs: On the Complexity of the Furthest Hyperplane Problem 2012

Online Optimization with Gradual Variations 2012

Toward a Noncommutative Arithmetic-geometric Mean Inequality: Conjectures, Case-studies, and Consequences 2012

Computational Bounds on Statistical Query Learning 2012

Rare Probability Estimation under Regularly Varying Heavy Tails 2012