Combining Unsupervised Pre-training and Annotator Rationales to Improve Low-shot Text Classification

Oren Melamud; Mihaela Bornea; Ken Barker

2019 EMNLP EMNLP 2019

Combining Unsupervised Pre-training and Annotator Rationales to Improve Low-shot Text Classification

Abstract

AbstractSupervised learning models often perform poorly at low-shot tasks, i.e. tasks for which little labeled data is available for training. One prominent approach for improving low-shot learning is to use unsupervised pre-trained neural models. Another approach is to obtain richer supervision by collecting annotator rationales (explanations supporting label annotations). In this work, we combine these two approaches to improve low-shot text classification with two novel methods: a simple bag-of-words embedding approach; and a more complex context-aware method, based on the BERT model. In experiments with two English text classification datasets, we demonstrate substantial performance gains from combining pre-training with rationales. Furthermore, our investigation of a range of train-set sizes reveals that the simple bag-of-words approach is the clear top performer when there are only a few dozen training instances or less, while more complex models, such as BERT or CNN, require more training data to shine.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Oren Melamud , Mihaela Bornea , Ken Barker

Topics

Artificial Intelligence > Learning Paradigms > Few-Shot Learning Natural Language Processing > Applications > Text Classification Machine Learning > Learning Paradigms > Few-Shot Learning Deep Learning > Learning Types > Self-Supervised Learning Artificial Intelligence > Core AI > Natural Language Processing

Keywords

few-shot learning text classification unsupervised pre-training annotator rationale low-shot learning

Download PDF

Related papers

Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation 2019

Chains-of-Reasoning at TextGraphs 2019 Shared Task: Reasoning over Chains of Facts for Explainable Multi-hop Inference 2019

A Boundary-aware Neural Model for Nested Named Entity Recognition 2019

Iterative Dual Domain Adaptation for Neural Machine Translation 2019

A Multi-Pairwise Extension of Procrustes Analysis for Multilingual Word Translation 2019