Efficient Label Collection for Unlabeled Image Datasets

Maggie Wigness; Bruce A. Draper; J. Ross Beveridge

2015 CVPR CVPR 2015

Efficient Label Collection for Unlabeled Image Datasets

Abstract

Visual classifiers are part of many applications including surveillance, autonomous navigation and scene understanding. The raw data used to train these classifiers is abundant and easy to collect but lacks labels. Labels are necessary for training supervised classifiers, but the labeling process requires significant human effort. Techniques like active learning and group-based labeling have emerged to help reduce the labeling workload. However, the possibility of collecting label noise affects either the efficiency of these systems or the performance of the trained classifiers. Further, many introduce latency by iteratively re-training classifiers or re-clustering data. We introduce a technique that searches for structural change in hierarchically clustered data to identify a set of clusters that span a spectrum of visual concept granularities. This allows us to efficiently label clusters with less label noise and produce high performing classifiers. The data is hierarchically clustered only once, eliminating latency during the labeling process. Using benchmark data we show that collecting labels with our approach is more efficient than existing labeling techniques, and achieves higher classification accuracy. Finally, we demonstrate the speed and efficiency of our system using real-world data collected for an autonomous navigation task.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning

📈 Trend Setter — Self-Supervised Learning

🧭 Keyword Pioneer — label collection

🐣 Hot Topic Early Bird — hierarchical clustering

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Maggie Wigness , Bruce A. Draper , J. Ross Beveridge

Topics

Machine Learning > Core Methods > Clustering Machine Learning > Learning Types > Active Learning Machine Learning > Learning Types > Semi-Supervised Learning Computer Vision > Domain-Specific > Autonomous Driving Deep Learning > Learning Types > Self-Supervised Learning Artificial Intelligence > Learning Paradigms > Active Learning

Keywords

active learning image classification semi-supervised learning scene understanding unsupervised clustering hierarchical clustering label noise visual classifier autonomous navigation label collection

Download PDF

Related papers

Long-Term Correlation Tracking 2015

Hierarchically-Constrained Optical Flow 2015

Propagated Image Filtering 2015

Web Scale Photo Hash Clustering on A Single Machine 2015

Expanding Object Detector's Horizon: Incremental Learning Framework for Object Detection in Videos 2015