Blending Autonomous Exploration and Apprenticeship Learning

Thomas J. Walsh; Daniel K. Hewlett; Clayton T. Morrison

2011 NIPS NeurIPS 2011

Blending Autonomous Exploration and Apprenticeship Learning

Abstract

We present theoretical and empirical results for a framework that combines the benefits of apprenticeship and autonomous reinforcement learning. Our approach modifies an existing apprenticeship learning framework that relies on teacher demonstrations and does not necessarily explore the environment. The first change is replacing previously used Mistake Bound model learners with a recently proposed framework that melds the KWIK and Mistake Bound supervised learning protocols. The second change is introducing a communication of expected utility from the student to the teacher. The resulting system only uses teacher traces when the agent needs to learn concepts it cannot efficiently learn on its own.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Reinforcement Learning

📈 Trend Setter — Agent Systems

🧭 Keyword Pioneer — autonomous exploration

🐣 Hot Topic Early Bird — reinforcement learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

Authors

Thomas J. Walsh , Daniel K. Hewlett , Clayton T. Morrison

Topics

Artificial Intelligence > Core AI > Agent Systems Artificial Intelligence > Learning Paradigms > Transfer Learning Reinforcement Learning > Methods > Deep RL Machine Learning > Learning Paradigms > Transfer Learning Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Learning Types > Imitation Learning Artificial Intelligence > Core AI > Reinforcement Learning

Keywords

reinforcement learning imitation learning knowledge transfer mistake bound model apprenticeship learning autonomous exploration agent learning teacher demonstrations kwik learning exploration strategy

Download PDF

Related papers

Co-Training for Domain Adaptation 2011

The Local Rademacher Complexity of Lp-Norm Multiple Kernel Learning 2011

Learning to Agglomerate Superpixel Hierarchies 2011

A Reinforcement Learning Theory for Homeostatic Regulation 2011

A Global Structural EM Algorithm for a Model of Cancer Progression 2011