Learning to combine foveal glimpses with a third-order Boltzmann machine

Hugo Larochelle; Geoffrey E. Hinton

2010 NIPS NeurIPS 2010

Learning to combine foveal glimpses with a third-order Boltzmann machine

Abstract

We describe a model based on a Boltzmann machine with third-order connections that can learn how to accumulate information about a shape over several fixations. The model uses a retina that only has enough high resolution pixels to cover a small area of the image, so it must decide on a sequence of fixations and it must combine the glimpse" at each fixation with the location of the fixation before integrating the information with information from other glimpses of the same object. We evaluate this model on a synthetic dataset and two image classification datasets, showing that it can perform at least as well as a model trained on whole images."

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning

📈 Trend Setter — Multimodal Learning

🧭 Keyword Pioneer — foveal glimpses

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning

🌱 Topic Pioneer — Attention

🐣 Hot Topic Early Bird — image classification

Authors

Hugo Larochelle , Geoffrey E. Hinton

Topics

Artificial Intelligence > Core AI > Multimodal Learning Machine Learning > Learning Types > Self-Supervised Learning Deep Learning > Architectures > Neural Networks Computer Vision > Analysis > Object Detection Computer Vision > Analysis > Scene Understanding Computer Vision > Core AI > Multimodal Learning Computer Vision > Core AI > Computer Vision Deep Learning > Techniques > Attention

Keywords

image classification attention mechanism visual attention boltzmann machine foveal glimpses multi-fixation learning information integration

Download PDF

Related papers

Link Discovery using Graph Feature Tracking 2010

Trading off Mistakes and Don't-Know Predictions 2010

A Novel Kernel for Learning a Neuron Model from Spike Train Data 2010

Decomposing Isotonic Regression for Efficiently Solving Large Problems 2010

Learning Kernels with Radiuses of Minimum Enclosing Balls 2010