Effectiveness of Pre-training for Few-shot Intent Classification

Haode Zhang; Yuwei Zhang; Li-Ming Zhan; JIAXIN CHEN; Guangyuan SHI; Albert Y.S. Lam; Xiao-ming Wu

2021 EMNLP EMNLP 2021

Effectiveness of Pre-training for Few-shot Intent Classification

Abstract

AbstractThis paper investigates the effectiveness of pre-training for few-shot intent classification. While existing paradigms commonly further pre-train language models such as BERT on a vast amount of unlabeled corpus, we find it highly effective and efficient to simply fine-tune BERT with a small set of labeled utterances from public datasets. Specifically, fine-tuning BERT with roughly 1,000 labeled data yields a pre-trained model – IntentBERT, which can easily surpass the performance of existing pre-trained models for few-shot intent classification on novel domains with very different semantics. The high effectiveness of IntentBERT confirms the feasibility and practicality of few-shot intent detection, and its high generalization ability across different domains suggests that intent classification tasks may share a similar underlying structure, which can be efficiently learned from a small set of labeled data. The source code can be found at https://github.com/hdzhang-code/IntentBERT.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Haode Zhang , Yuwei Zhang , Li-Ming Zhan , JIAXIN CHEN , Guangyuan SHI , Albert Y.S. Lam , Xiao-ming Wu

Topics

Artificial Intelligence > Core AI > Foundation Models Artificial Intelligence > Learning Paradigms > Few-Shot Learning Machine Learning > Learning Types > Transfer Learning Deep Learning > Learning Types > Few-Shot Learning Artificial Intelligence > Core AI > Natural Language Processing Deep Learning > Learning Types > Fine-Tuning

Keywords

few-shot learning transfer learning domain adaptation intent classification language model

Download PDF

Related papers

Continual Learning in Multilingual NMT via Language-Specific Embeddings 2021

MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents 2021

Efficient Multi-Task Auxiliary Learning: Selecting Auxiliary Data by Feature Similarity 2021

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings 2021

Semantics-Preserved Data Augmentation for Aspect-Based Sentiment Analysis 2021