Harvesting Discriminative Meta Objects With Deep CNN Features for Scene Classification

Ruobing Wu; Baoyuan Wang; Wenping Wang; Yizhou Yu

2015 ICCV ICCV 2015

Harvesting Discriminative Meta Objects With Deep CNN Features for Scene Classification

Abstract

Recent work on scene classification still makes use of generic CNN features in a rudimentary manner. In this paper, we present a novel pipeline built upon deep CNN features to harvest discriminative visual objects and parts for scene classification. We first use a region proposal technique to generate a set of high-quality patches potentially containing objects, and apply a pre-trained CNN to extract generic deep features from these patches. Then we perform both unsupervised and weakly supervised learning to screen these patches and discover discriminative ones representing category-specific objects and parts. We further apply discriminative clustering enhanced with local CNN fine-tuning to aggregate similar objects and parts into groups, called meta objects. A scene image representation is constructed by pooling the feature response maps of all the learned meta objects at multiple spatial scales. We have confirmed that the scene image representation obtained using this new pipeline is capable of delivering state-of-the-art performance on two popular scene benchmark datasets, MIT Indoor 67 [22] and Sun397 [31].

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — deep cnn feature

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ruobing Wu , Baoyuan Wang , Wenping Wang , Yizhou Yu

Topics

Machine Learning > Core Methods > Clustering Machine Learning > Learning Types > Weakly Supervised Learning Computer Vision > Analysis > Scene Understanding Deep Learning > Learning Types > Deep Learning

Keywords

feature extraction scene classification object detection weakly supervised learning discriminative clustering region proposal convolutional neural network deep cnn feature meta object

Download PDF

Related papers

Cutting Edge: Soft Correspondences in Multimodal Scene Parsing 2015

Unsupervised Generation of a Viewpoint Annotated Car Dataset From Videos 2015

Depth-Based Hand Pose Estimation: Data, Methods, and Challenges 2015

Peeking Template Matching for Depth Extension 2015

Learning Semi-Supervised Representation Towards a Unified Optimization Framework for Semi-Supervised Learning 2015