Environmental statistics and the trade-off between model-based and TD learning in humans

Dylan A. Simon; Nathaniel D. Daw

2011 NIPS NeurIPS 2011

Environmental statistics and the trade-off between model-based and TD learning in humans

Abstract

There is much evidence that humans and other animals utilize a combination of model-based and model-free RL methods. Although it has been proposed that these systems may dominate according to their relative statistical efficiency in different circumstances, there is little specific evidence -- especially in humans -- as to the details of this trade-off. Accordingly, we examine the relative performance of different RL approaches under situations in which the statistics of reward are differentially noisy and volatile. Using theory and simulation, we show that model-free TD learning is relatively most disadvantaged in cases of high volatility and low noise. We present data from a decision-making experiment manipulating these parameters, showing that humans shift learning strategies in accord with these predictions. The statistical circumstances favoring model-based RL are also those that promote a high learning rate, which helps explain why, in psychology, the distinction between these strategies is traditionally conceived in terms of rule-based vs. incremental learning.

🌉 Interdisciplinary Bridge — Interdisciplinary and Machine Learning and Reinforcement Learning

📈 Trend Setter — Self-Supervised Learning

🧭 Keyword Pioneer — reward volatility

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

🐣 Hot Topic Early Bird — reinforcement learning

Authors

Dylan A. Simon , Nathaniel D. Daw

Topics

Machine Learning > Learning Types > Self-Supervised Learning Reinforcement Learning > Methods > Deep RL Interdisciplinary > Cognitive Science > Cognitive Modeling Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Learning Types > Model-Based RL Artificial Intelligence > Core AI > Decision Making

Keywords

reinforcement learning temporal difference learning cognitive modeling model-based learning model-based reinforcement learning human decision-making reward volatility learning rate adaptation strategy adaptation learning rate model-free learning decision-making experiment

Download PDF

Related papers

Co-Training for Domain Adaptation 2011

The Local Rademacher Complexity of Lp-Norm Multiple Kernel Learning 2011

Learning to Agglomerate Superpixel Hierarchies 2011

A Reinforcement Learning Theory for Homeostatic Regulation 2011

A Global Structural EM Algorithm for a Model of Cancer Progression 2011