Experience Replay for Continual Learning

David Rolnick; Arun Ahuja; Jonathan Schwarz; Timothy Lillicrap; Gregory Wayne

2019 NIPS NeurIPS 2019

Experience Replay for Continual Learning

Abstract

Interacting with a complex world involves continual learning, in which tasks and data distributions change over time. A continual learning system should demonstrate both plasticity (acquisition of new knowledge) and stability (preservation of old knowledge). Catastrophic forgetting is the failure of stability, in which new experience overwrites previous experience. In the brain, replay of past experience is widely believed to reduce forgetting, yet it has been largely overlooked as a solution to forgetting in deep reinforcement learning. Here, we introduce CLEAR, a replay-based method that greatly reduces catastrophic forgetting in multi-task reinforcement learning. CLEAR leverages off-policy learning and behavioral cloning from replay to enhance stability, as well as on-policy learning to preserve plasticity. We show that CLEAR performs better than state-of-the-art deep learning techniques for mitigating forgetting, despite being significantly less complicated and not requiring any knowledge of the individual tasks being learned.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Reinforcement Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

David Rolnick , Arun Ahuja , Jonathan Schwarz , Timothy Lillicrap , Gregory Wayne

Topics

Machine Learning > Learning Types > Continual Learning Reinforcement Learning > Methods > Deep RL Machine Learning > Learning Paradigms > Continual Learning Deep Learning > Learning Types > Continual Learning

Keywords

continual learning catastrophic forgetting off-policy learning behavioral cloning experience replay multi-task reinforcement learning

Download PDF

Related papers

Two Generator Game: Learning to Sample via Linear Goodness-of-Fit Test 2019

Metalearned Neural Memory 2019

Model Similarity Mitigates Test Set Overuse 2019

Continual Unsupervised Representation Learning 2019

Reinforcement Learning with Convex Constraints 2019