Deep Bayesian Active Learning for Natural Language Processing: Results of a Large-Scale Empirical Study

Aditya Siddhant; Zachary C. Lipton

2018 EMNLP EMNLP 2018

Deep Bayesian Active Learning for Natural Language Processing: Results of a Large-Scale Empirical Study

Abstract

AbstractSeveral recent papers investigate Active Learning (AL) for mitigating the data dependence of deep learning for natural language processing. However, the applicability of AL to real-world problems remains an open question. While in supervised learning, practitioners can try many different methods, evaluating each against a validation set before selecting a model, AL affords no such luxury. Over the course of one AL run, an agent annotates its dataset exhausting its labeling budget. Thus, given a new task, we have no opportunity to compare models and acquisition functions. This paper provides a large-scale empirical study of deep active learning, addressing multiple tasks and, for each, multiple datasets, multiple models, and a full suite of acquisition functions. We find that across all settings, Bayesian active learning by disagreement, using uncertainty estimates provided either by Dropout or Bayes-by-Backprop significantly improves over i.i.d. baselines and usually outperforms classic uncertainty sampling.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — dropout uncertainty

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Aditya Siddhant , Zachary C. Lipton

Topics

Machine Learning > Learning Types > Active Learning Machine Learning > Learning Types > Semi-Supervised Learning Machine Learning > Optimization & Theory > Bayesian Inference Natural Language Processing > Resources & Methods > Large Language Models Machine Learning > Bayesian & Probabilistic > Bayesian Inference Machine Learning > Learning Paradigms > Active Learning Artificial Intelligence > Core AI > Natural Language Processing

Keywords

bayesian inference natural language processing deep learning uncertainty sampling bayesian active learning bayesian deep learning dropout uncertainty deep active learning disagreement sampling

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018