Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates

Artem Shelmanov; Dmitri Puzyrev; Lyubov Kupriyanova; Denis Belyakov; Daniil Larionov; Nikita Khromov; Olga Kozlova; Ekaterina Artemova; Dmitry V. Dylov; Alexander Panchenko

2021 EACL EACL 2021

Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates

Abstract

AbstractAnnotating training data for sequence tagging of texts is usually very time-consuming. Recent advances in transfer learning for natural language processing in conjunction with active learning open the possibility to significantly reduce the necessary annotation budget. We are the first to thoroughly investigate this powerful combination for the sequence tagging task. We conduct an extensive empirical study of various Bayesian uncertainty estimation methods and Monte Carlo dropout options for deep pre-trained models in the active learning framework and find the best combinations for different types of models. Besides, we also demonstrate that to acquire instances during active learning, a full-size Transformer can be substituted with a distilled version, which yields better computational performance and reduces obstacles for applying deep active learning in practice.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🐣 Hot Topic Early Bird — model distillation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Artem Shelmanov , Dmitri Puzyrev , Lyubov Kupriyanova , Denis Belyakov , Daniil Larionov , Nikita Khromov , Olga Kozlova , Ekaterina Artemova , Dmitry V. Dylov , Alexander Panchenko

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Machine Learning > Learning Types > Active Learning Machine Learning > Optimization & Theory > Bayesian Inference Machine Learning > Learning Types > Transfer Learning Natural Language Processing > Applications > Named Entity Recognition Machine Learning > Bayesian & Probabilistic > Bayesian Inference

Keywords

active learning transfer learning bayesian inference model distillation sequence tagging uncertainty estimation pre-trained language model monte carlo dropout bayesian uncertainty

Download PDF

Related papers

Joint Coreference Resolution and Character Linking for Multiparty Conversation 2021

Progressively Pretrained Dense Corpus Index for Open-Domain Question Answering 2021

Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO 2021

Representations for Question Answering from Documents with Tables and Text 2021

Gender and Racial Fairness in Depression Research using Social Media 2021