Spoken Attributes: Mixing Binary and Relative Attributes to Say the Right Thing

Amir Sadovnik; Andrew Gallagher; Devi Parikh; Tsuhan Chen

2013 ICCV ICCV 2013

Spoken Attributes: Mixing Binary and Relative Attributes to Say the Right Thing

Abstract

In recent years, there has been a great deal of progress in describing objects with attributes. Attributes have proven useful for object recognition, image search, face verification, image description, and zero-shot learning. Typically, attributes are either binary or relative: they describe either the presence or absence of a descriptive characteristic, or the relative magnitude of the characteristic when comparing two exemplars. However, prior work fails to model the actual way in which humans use these attributes in descriptive statements of images. Specifically, it does not address the important interactions between the binary and relative aspects of an attribute. In this work we propose a spoken attribute classifier which models a more natural way of using an attribute in a description. For each attribute we train a classifier which captures the specific way this attribute should be used. We show that as a result of using this model, we produce descriptions about images of people that are more natural and specific than past systems.

🚀 Conference Pioneer — ICCV 2013

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Machine Learning

🐣 Hot Topic Early Bird — zero-shot learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Amir Sadovnik , Andrew Gallagher , Devi Parikh , Tsuhan Chen

Topics

Machine Learning > Core Methods > Classification Machine Learning > Learning Types > Zero-Shot Learning Computer Vision > Analysis > Face Recognition Artificial Intelligence > Core AI > Computer Vision

Keywords

zero-shot learning facial attribute attribute classification image description relative attribute binary attribute

Download PDF

Related papers

Large-Scale Multi-resolution Surface Reconstruction from RGB-D Sequences 2013

Cascaded Shape Space Pruning for Robust Facial Landmark Detection 2013

Unsupervised Intrinsic Calibration from a Single Frame Using a "Plumb-Line" Approach 2013

Accurate and Robust 3D Facial Capture Using a Single RGBD Camera 2013

From Where and How to What We See 2013