Semi-Supervised Sequence Modeling with Cross-View Training

Kevin Clark; Minh-Thang Luong; Christopher D. Manning; Quoc Le

2018 EMNLP EMNLP 2018

Semi-Supervised Sequence Modeling with Cross-View Training

Abstract

AbstractUnsupervised representation learning algorithms such as word2vec and ELMo improve the accuracy of many supervised NLP models, mainly because they can take advantage of large amounts of unlabeled text. However, the supervised models only learn from task-specific labeled data during the main training phase. We therefore propose Cross-View Training (CVT), a semi-supervised learning algorithm that improves the representations of a Bi-LSTM sentence encoder using a mix of labeled and unlabeled data. On labeled examples, standard supervised learning is used. On unlabeled examples, CVT teaches auxiliary prediction modules that see restricted views of the input (e.g., only part of a sentence) to match the predictions of the full model seeing the whole input. Since the auxiliary modules and the full model share intermediate representations, this in turn improves the full model. Moreover, we show that CVT is particularly effective when combined with multi-task learning. We evaluate CVT on five sequence tagging tasks, machine translation, and dependency parsing, achieving state-of-the-art results.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — cross-view training

🐣 Hot Topic Early Bird — sequence tagging

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Kevin Clark , Minh-Thang Luong , Christopher D. Manning , Quoc Le

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Learning Types > Semi-Supervised Learning Machine Learning > Learning Types > Multi-Task Learning Natural Language Processing > Resources & Methods > Transfer Learning Deep Learning > Learning Types > Self-Supervised Learning Machine Learning > Learning Paradigms > Semi-Supervised Learning Deep Learning > Learning Types > Semi-Supervised Learning Natural Language Processing > Applications > Sequence Labeling

Keywords

representation learning semi-supervised learning multi-task learning sequence tagging cross-view training bi-lstm encoder

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018