It's Not Polite to Point: Describing People with Uncertain Attributes

Amir Sadovnik; Andrew Gallagher; Tsuhan Chen

2013 CVPR CVPR 2013

It's Not Polite to Point: Describing People with Uncertain Attributes

Abstract

Visual attributes are powerful features for many different applications in computer vision such as object detection and scene recognition. Visual attributes present another application that has not been examined as rigorously: verbal communication from a computer to a human. Since many attributes are nameable, the computer is able to communicate these concepts through language. However, this is not a trivial task. Given a set of attributes, selecting a subset to be communicated is task dependent. Moreover, because attribute classifiers are noisy, it is important to find ways to deal with this uncertainty. We address the issue of communication by examining the task of composing an automatic description of a person in a group photo that distinguishes him from the others. We introduce an efficient, principled method for choosing which attributes are included in a short description to maximize the likelihood that a third party will correctly guess to which person the description refers. We compare our algorithm to computer baselines and human describers, and show the strength of our method in creating effective descriptions.

🚀 Conference Pioneer — CVPR 2013

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Machine Learning and Natural Language Processing

📈 Trend Setter — Text Generation

🧭 Keyword Pioneer — person description

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy

Authors

Amir Sadovnik , Andrew Gallagher , Tsuhan Chen

Topics

Artificial Intelligence > Core AI > Human-AI Interaction Computer Vision > Analysis > Human Analysis Natural Language Processing > Applications > Text Generation Artificial Intelligence > Core AI > Computer Vision Machine Learning > Learning Types > Uncertainty Quantification Computer Vision > Analysis > Computer Vision

Keywords

person description attribute recognition visual attribute attribute classification image description uncertainty handling uncertain attribute attribute classifier verbal communication

Download PDF

Related papers

Nonlinearly Constrained MRFs: Exploring the Intrinsic Dimensions of Higher-Order Cliques 2013

An Approach to Pose-Based Action Recognition 2013

Modeling Actions through State Changes 2013

A Convex Regularizer for Reducing Color Artifact in Color Image Recovery 2013

Deformable Spatial Pyramid Matching for Fast Dense Correspondences 2013