Multi-Label Image Recognition With Graph Convolutional Networks

Zhao-Min Chen; Xiu-Shen Wei; Peng Wang; Yanwen Guo

2019 CVPR CVPR 2019

Multi-Label Image Recognition With Graph Convolutional Networks

Abstract

The task of multi-label image recognition is to predict a set of object labels that present in an image. As objects normally co-occur in an image, it is desirable to model the label dependencies to improve the recognition performance. To capture and explore such important dependencies, we propose a multi-label classification model based on Graph Convolutional Network (GCN). The model builds a directed graph over the object labels, where each node (label) is represented by word embeddings of a label, and GCN is learned to map this label graph into a set of inter-dependent object classifiers. These classifiers are applied to the image descriptors extracted by another sub-net, enabling the whole network to be end-to-end trainable. Furthermore, we propose a novel re-weighted scheme to create an effective label correlation matrix to guide information propagation among the nodes in GCN. Experiments on two multi-label image recognition datasets show that our approach obviously outperforms other existing state-of-the-art methods. In addition, visualization analyses reveal that the classifiers learned by our model maintain meaningful semantic topology.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

📈 Trend Setter — Multi-Label Classification

🧭 Keyword Pioneer — semantic topology

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zhao-Min Chen , Xiu-Shen Wei , Peng Wang , Yanwen Guo

Topics

Deep Learning > Architectures > Graph Neural Networks Computer Vision > Analysis > Object Detection Computer Vision > Analysis > Image Classification Deep Learning > Learning Types > Multi-Label Classification

Keywords

object recognition multi-label classification label dependencies label dependency word embedding graph convolutional network semantic topology

Download PDF

Related papers

Fast Single Image Reflection Suppression via Convex Optimization 2019

Learning Video Representations From Correspondence Proposals 2019

ATOM: Accurate Tracking by Overlap Maximization 2019

Visual Tracking via Adaptive Spatially-Regularized Correlation Filters 2019

Edge-Labeling Graph Neural Network for Few-Shot Learning 2019