Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes

Jack Rae; Jonathan J Hunt; Ivo Danihelka; Timothy Harley; Andrew W. Senior; Gregory Wayne; Alex Graves; Timothy Lillicrap

2016 NIPS NeurIPS 2016

Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes

Abstract

Neural networks augmented with external memory have the ability to learn algorithmic solutions to complex tasks. These models appear promising for applications such as language modeling and machine translation. However, they scale poorly in both space and time as the amount of memory grows --- limiting their applicability to real-world domains. Here, we present an end-to-end differentiable memory access scheme, which we call Sparse Access Memory (SAM), that retains the representational power of the original approaches whilst training efficiently with very large memories. We show that SAM achieves asymptotic lower bounds in space and time complexity, and find that an implementation runs $1,\!000\times$ faster and with $3,\!000\times$ less physical memory than non-sparse models. SAM learns with comparable data efficiency to existing models on a range of synthetic tasks and one-shot Omniglot character recognition, and can scale to tasks requiring $100,\!000$s of time steps and memories. As well, we show how our approach can be adapted for models that maintain temporal associations between memories, as with the recently introduced Differentiable Neural Computer.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

📈 Trend Setter — Memory

🧭 Keyword Pioneer — memory-augmented neural network

🐣 Hot Topic Early Bird — one-shot learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jack Rae , Jonathan J Hunt , Ivo Danihelka , Timothy Harley , Andrew W. Senior , Gregory Wayne , Alex Graves , Timothy Lillicrap

Topics

Artificial Intelligence > Core AI > Memory Machine Learning > Optimization & Theory > Neural Network Optimization Machine Learning > Application Areas > Efficient Computing Deep Learning > Architectures > Neural Networks Deep Learning > Models > Neural Networks Deep Learning > Learning Types > Representation Learning

Keywords

one-shot learning language modeling external memory memory-augmented neural network sparse memory access differentiable memory scalable memory sparse access neural computer

Download PDF

Related papers

Bayesian Intermittent Demand Forecasting for Large Inventories 2016

Dynamic Network Surgery for Efficient DNNs 2016

Beyond Exchangeability: The Chinese Voting Process 2016

Safe and Efficient Off-Policy Reinforcement Learning 2016

Tagger: Deep Unsupervised Perceptual Grouping 2016