2015 NIPS NeurIPS 2015

Visalogy: Answering Visual Analogy Questions

Abstract

In this paper, we study the problem of answering visual analogy questions. These questions take the form of image A is to image B as image C is to what. Answering these questions entails discovering the mapping from image A to image B and then extending the mapping to image C and searching for the image D such that the relation from A to B holds for C to D. We pose this problem as learning an embedding that encourages pairs of analogous images with similar transformations to be close together using convolutional neural networks with a quadruple Siamese architecture. We introduce a dataset of visual analogy questions in natural images, and show first results of its kind on solving analogy questions on natural images.

🌱 Topic Pioneer — Visual Question Answering
🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning
📈 Trend Setter — Visual Question Answering
🧭 Keyword Pioneer — visual analogy
🐣 Hot Topic Early Bird — visual reasoning
🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio