Spatial-Semantic Image Search by Visual Feature Synthesis

Long Mai; Hailin Jin; Zhe Lin; Chen Fang; Jonathan Brandt; Feng Liu

2017 CVPR CVPR 2017

Spatial-Semantic Image Search by Visual Feature Synthesis

Abstract

The performance of image retrieval has been improved tremendously in recent years through the use of deep feature representations. Most existing methods, however, aim to retrieve images that are visually similar or semantically relevant to the query, irrespective of spatial configuration. In this paper, we develop a spatial-semantic image search technology that enables users to search for images with both semantic and spatial constraints by manipulating concept text-boxes on a 2D query canvas. We train a convolutional neural network to synthesize appropriate visual features that captures the spatial-semantic constraints from the user canvas query. We directly optimize the retrieval performance of the visual features when training our deep neural network. These visual features then are used to retrieve images that are both spatially and semantically relevant to the user query. The experiments on large-scale datasets such as MS-COCO and Visual Genome show that our method outperforms other baseline and state-of-the-art methods in spatial-semantic image search.

🌉 Interdisciplinary Bridge — Computer Science and Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — semantic constraint

🐣 Hot Topic Early Bird — semantic search

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Long Mai , Hailin Jin , Zhe Lin , Chen Fang , Jonathan Brandt , Feng Liu

Topics

Machine Learning > Core Methods > Metric Learning Computer Vision > Analysis > Scene Understanding Computer Science > Applications > Information Retrieval Deep Learning > Learning Types > Representation Learning Computer Vision > Processing > Image Retrieval

Keywords

image retrieval semantic search feature embedding convolutional neural network semantic constraint spatial constraint visual feature synthesis spatial-semantic search

Download PDF

Related papers

Deep Outdoor Illumination Estimation 2017

SRN: Side-output Residual Network for Object Symmetry Detection in the Wild 2017

Weakly Supervised Semantic Segmentation Using Web-Crawled Videos 2017

FASON: First and Second Order Information Fusion Network for Texture Recognition 2017

Recurrent Convolutional Neural Networks for Continuous Sign Language Recognition by Staged Optimization 2017