Near-Optimally Teaching the Crowd to Classify

Adish Singla; Ilija Bogunovic; Gabor Bartok; Amin Karbasi; Andreas Krause

2014 ICML ICML 2014

Near-Optimally Teaching the Crowd to Classify

Abstract

How should we present training examples to learners to teach them classification rules? This is a natural problem when training workers for crowdsourcing labeling tasks, and is also motivated by challenges in data-driven online education. We propose a natural stochastic model of the learners, modeling them as randomly switching among hypotheses based on observed feedback. We then develop STRICT, an efficient algorithm for selecting examples to teach to workers. Our solution greedily maximizes a submodular surrogate objective function in order to select examples to show to the learners. We prove that our strategy is competitive with the optimal teaching policy. Moreover, for the special case of linear separators, we prove that an exponential reduction in error probability can be achieved. Our experiments on simulated workers as well as three real image annotation tasks on Amazon Mechanical Turk show the effectiveness of our teaching algorithm.

🧭 Keyword Pioneer — teaching algorithm

🐣 Hot Topic Early Bird — active learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

Authors

Adish Singla , Ilija Bogunovic , Gabor Bartok , Amin Karbasi , Andreas Krause

Topics

Artificial Intelligence > Core AI > Multi-Agent Systems Machine Learning > Core Methods > Classification Machine Learning > Learning Types > Active Learning Machine Learning > Learning Types > Online Learning Machine Learning > Learning Types > Multi-Agent Systems

Keywords

submodular optimization active learning online learning stochastic model greedy algorithm teaching algorithm crowd teaching classification teaching

Download PDF

Related papers

Demystifying Information-Theoretic Clustering 2014

Margins, Kernels and Non-linear Smoothed Perceptrons 2014

Large-Margin Metric Learning for Constrained Partitioning Problems 2014

Efficient Approximation of Cross-Validation for Kernel Methods using Bouligand Influence Function 2014

Generalized Exponential Concentration Inequality for Renyi Divergence Estimation 2014