Semi-supervised sequence tagging with bidirectional language models

Matthew E. Peters; Waleed Ammar; Chandra Bhagavatula; Russell Power

2017 ACL ACL 2017

Semi-supervised sequence tagging with bidirectional language models

Abstract

AbstractPre-trained word embeddings learned from unlabeled text have become a standard component of neural network architectures for NLP tasks. However, in most cases, the recurrent network that operates on word-level representations to produce context sensitive representations is trained on relatively little labeled data. In this paper, we demonstrate a general semi-supervised approach for adding pretrained context embeddings from bidirectional language models to NLP systems and apply it to sequence labeling tasks. We evaluate our model on two standard datasets for named entity recognition (NER) and chunking, and in both cases achieve state of the art results, surpassing previous systems that use other forms of transfer or joint learning with additional labeled data and task specific gazetteers.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — contextual embedding

🐣 Hot Topic Early Bird — contextual embedding

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Matthew E. Peters , Waleed Ammar , Chandra Bhagavatula , Russell Power

Topics

Machine Learning > Learning Types > Semi-Supervised Learning Deep Learning > Architectures > Transformers Natural Language Processing > Understanding > Named Entity Recognition

Keywords

transfer learning sequence labeling named entity recognition contextual embedding bidirectional language model

Download PDF

Related papers

A* CCG Parsing with a Supertag and Dependency Factored Model 2017

Detecting annotation noise in automatically labelled data 2017

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) 2017

Annotating tense, mood and voice for English, French and German 2017

Word Embedding for Response-To-Text Assessment of Evidence 2017