The Multidimensional Wisdom of Crowds

Peter Welinder; Steve Branson; Pietro Perona; Serge J. Belongie

2010 NIPS NeurIPS 2010

The Multidimensional Wisdom of Crowds

Abstract

Distributing labeling tasks among hundreds or thousands of annotators is an increasingly important method for annotating large datasets. We present a method for estimating the underlying value (e.g. the class) of each image from (noisy) annotations provided by multiple annotators. Our method is based on a model of the image formation and annotation process. Each image has different characteristics that are represented in an abstract Euclidean space. Each annotator is modeled as a multidimensional entity with variables representing competence, expertise and bias. This allows the model to discover and represent groups of annotators that have different sets of skills and knowledge, as well as groups of images that differ qualitatively. We find that our model predicts ground truth labels on both synthetic and real data more accurately than state of the art methods. Experiments also show that our model, starting from a set of binary labels, may discover rich information, such as different "schools of thought" amongst the annotators, and can group together images belonging to separate categories.

🌱 Topic Pioneer — Probabilistic Modeling

🌉 Interdisciplinary Bridge — Artificial Intelligence and Data Science & Analytics and Machine Learning

📈 Trend Setter — Clustering

🧭 Keyword Pioneer — crowdsourcing annotation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning

🐣 Hot Topic Early Bird — probabilistic modeling

Authors

Peter Welinder , Steve Branson , Pietro Perona , Serge J. Belongie

Topics

Artificial Intelligence > Bayesian & Probabilistic > Probabilistic Modeling Machine Learning > Core Methods > Classification Data Science & Analytics > Applications > Clustering Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling Machine Learning > Core Methods > Probabilistic Modeling

Keywords

probabilistic modeling label aggregation crowdsourcing annotation multi-annotator learning annotator modeling

Download PDF

Related papers

Link Discovery using Graph Feature Tracking 2010

Trading off Mistakes and Don't-Know Predictions 2010

A Novel Kernel for Learning a Neuron Model from Spike Train Data 2010

Decomposing Isotonic Regression for Efficiently Solving Large Problems 2010

Learning Kernels with Radiuses of Minimum Enclosing Balls 2010