Active Learning for Probabilistic Hypotheses Using the Maximum Gibbs Error Criterion

Nguyen Viet Cuong; Wee Sun Lee; Nan Ye; Kian Ming A Chai; Hai Leong Chieu

2013 NIPS NeurIPS 2013

Active Learning for Probabilistic Hypotheses Using the Maximum Gibbs Error Criterion

Abstract

We introduce a new objective function for pool-based Bayesian active learning with probabilistic hypotheses. This objective function, called the policy Gibbs error, is the expected error rate of a random classifier drawn from the prior distribution on the examples adaptively selected by the active learning policy. Exact maximization of the policy Gibbs error is hard, so we propose a greedy strategy that maximizes the Gibbs error at each iteration, where the Gibbs error on an instance is the expected error of a random classifier selected from the posterior label distribution on that instance. We apply this maximum Gibbs error criterion to three active learning scenarios: non-adaptive, adaptive, and batch active learning. In each scenario, we prove that the criterion achieves near-maximal policy Gibbs error when constrained to a fixed budget. For practical implementations, we provide approximations to the maximum Gibbs error criterion for Bayesian conditional random fields and transductive Naive Bayes. Our experimental results on a named entity recognition task and a text classification task show that the maximum Gibbs error criterion is an effective active learning criterion for noisy models.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

📈 Trend Setter — Text Classification

🧭 Keyword Pioneer — maximum gibbs error

🐣 Hot Topic Early Bird — active learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Nguyen Viet Cuong , Wee Sun Lee , Nan Ye , Kian Ming A Chai , Hai Leong Chieu

Topics

Artificial Intelligence > Bayesian & Probabilistic > Bayesian Learning Machine Learning > Learning Types > Active Learning Natural Language Processing > Applications > Text Classification Machine Learning > Bayesian & Probabilistic > Bayesian Learning Natural Language Processing > Applications > Named Entity Recognition Machine Learning > Bayesian & Probabilistic > Bayesian Inference Machine Learning > Learning Paradigms > Active Learning

Keywords

active learning text classification named entity recognition bayesian active learning maximum gibbs error probabilistic hypotheses pool-based active learning probabilistic hypothesis gibbs error

Download PDF

Related papers

Latent Structured Active Learning 2013

On Flat versus Hierarchical Classification in Large-Scale Taxonomies 2013

Generalized Method-of-Moments for Rank Aggregation 2013

Third-Order Edge Statistics: Contour Continuation, Curvature, and Cortical Connections 2013

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent 2013