Variational Pretraining for Semi-supervised Text Classification

Suchin Gururangan; Tam Dang; Dallas Card; Noah A. Smith

2019 ACL ACL 2019

Variational Pretraining for Semi-supervised Text Classification

Abstract

AbstractWe introduce VAMPIRE, a lightweight pretraining framework for effective text classification when data and computing resources are limited. We pretrain a unigram document model as a variational autoencoder on in-domain, unlabeled data and use its internal states as features in a downstream classifier. Empirically, we show the relative strength of VAMPIRE against computationally expensive contextual embeddings and other popular semi-supervised baselines under low resource settings. We also find that fine-tuning to in-domain data is crucial to achieving decent performance from contextual embeddings when working with limited supervision. We accompany this paper with code to pretrain and use VAMPIRE embeddings in downstream tasks.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — pretrained embedding

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Suchin Gururangan , Tam Dang , Dallas Card , Noah A. Smith

Topics

Machine Learning > Learning Types > Semi-Supervised Learning Machine Learning > Optimization & Theory > Bayesian Inference Deep Learning > Models > Variational Inference Deep Learning > Techniques > Pretraining Natural Language Processing > Applications > Text Classification Machine Learning > Learning Paradigms > Semi-Supervised Learning Deep Learning > Learning Types > Semi-Supervised Learning

Keywords

unsupervised learning semi-supervised learning feature learning text classification variational autoencoder pretrained embedding unlabeled datum in-domain pretraining

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019