The Application of Two-Level Attention Models in Deep Convolutional Neural Network for Fine-Grained Image Classification

Tianjun Xiao; Yichong Xu; Kuiyuan Yang; Jiaxing Zhang; Yuxin Peng; Zheng Zhang

2015 CVPR CVPR 2015

The Application of Two-Level Attention Models in Deep Convolutional Neural Network for Fine-Grained Image Classification

Abstract

Fine-grained classification is challenging because categories can only be discriminated by subtle and local differences. Variances in the pose, scale or rotation usually make the problem more difficult. Most fine-grained classification systems follow the pipeline of finding foreground object or object parts (where) to extract discriminative features (what). In this paper, we propose to apply visual attention to fine-grained classification task using deep neural network. Our pipeline integrates three types of attention: the bottom-up attention that propose candidate patches, the object-level top-down attention that selects relevant patches to a certain object, and the part-level top-down attention that localizes discriminative parts. We combine these attentions to train domain-specific deep nets, then use it to improve both the what and where aspects. Importantly, we avoid using expensive annotations like bounding box or part information from end-to-end. The weak supervision constraint makes our work easier to generalize. We have verified the effectiveness of the method on the subsets of ILSVRC2012 dataset and CUB200_2011 dataset. Our pipeline delivered significant improvements and achieved the best accuracy under the weakest supervision condition. The performance is competitive against other methods that rely on additional annotations.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning

📈 Trend Setter — Attention

🧭 Keyword Pioneer — fine-grained image classification

🐣 Hot Topic Early Bird — fine-grained classification

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Tianjun Xiao , Yichong Xu , Kuiyuan Yang , Jiaxing Zhang , Yuxin Peng , Zheng Zhang

Topics

Machine Learning > Learning Types > Weakly Supervised Learning Deep Learning > Techniques > Model Architecture Computer Vision > Analysis > Semantic Segmentation Computer Vision > Analysis > Image Classification Deep Learning > Architectures > Convolutional Neural Networks Artificial Intelligence > Core AI > Attention

Keywords

feature extraction weakly supervised learning fine-grained classification visual attention convolutional neural network part localization fine-grained image classification

Download PDF

Related papers

Long-Term Correlation Tracking 2015

Hierarchically-Constrained Optical Flow 2015

Propagated Image Filtering 2015

Web Scale Photo Hash Clustering on A Single Machine 2015

Expanding Object Detector's Horizon: Incremental Learning Framework for Object Detection in Videos 2015