Sequential crowdsourced labeling as an epsilon-greedy exploration in a Markov Decision Process

Vikas Raykar; Priyanka Agrawal

2014 AISTATS AISTATS 2014

Sequential crowdsourced labeling as an epsilon-greedy exploration in a Markov Decision Process

Abstract

Crowdsourcing marketplaces are widely used for curating large annotated datasets by collecting labels from multiple annotators. In such scenarios one has to balance the tradeoff between the accuracy of the collected labels, the cost of acquiring these labels, and the time taken to finish the labeling task. With the goal of reducing the labeling cost, we introduce the notion of sequential crowdsourced labeling, where instead of asking for all the labels in one shot we acquire labels from annotators sequentially one at a time. We model it as an epsilon-greedy exploration in a Markov Decision Process with a Bayesian decision theoretic utility function that incorporates accuracy, cost and time. Experimental results confirm that the proposed sequential labeling procedure can achieve similar accuracy at roughly half the labeling cost and at any stage in the labeling process the algorithm achieves a higher accuracy compared to randomly asking for the next label.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — sequential labeling

🐣 Hot Topic Early Bird — markov decision process

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Vikas Raykar , Priyanka Agrawal

Topics

Machine Learning > Learning Types > Active Learning Machine Learning > Optimization & Theory > Bayesian Inference Reinforcement Learning > Methods > Deep RL Machine Learning > Learning Types > Online Learning Machine Learning > Learning Types > Multi-Armed Bandits

Keywords

active learning markov decision process bayesian decision theory crowdsourced labeling sequential labeling epsilon-greedy exploration

Download PDF

Related papers

Improved Bounds for Online Learning Over the Permutahedron and Other Ranking Polytopes 2014

PAC-Bayesian Theory for Transductive Learning 2014

Sparse Bayesian Variable Selection for the Identification of Antigenic Variability in the Foot-and-Mouth Disease Virus 2014

Analytic Long-Term Forecasting with Periodic Gaussian Processes 2014

Exploiting the Limits of Structure Learning via Inherent Symmetry 2014