Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification

Mujeen Sung; James Gung; Elman Mansimov; Nikolaos Pappas; Raphael Shu; Salvatore Romeo; Yi Zhang; Vittorio Castelli

2023 EMNLP EMNLP 2023

Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification

Abstract

AbstractIntent classification (IC) plays an important role in task-oriented dialogue systems. However, IC models often generalize poorly when training without sufficient annotated examples for each user intent. We propose a novel pre-training method for text encoders that uses contrastive learning with intent psuedo-labels to produce embeddings that are well-suited for IC tasks, reducing the need for manual annotations. By applying this pre-training strategy, we also introduce Pre-trained Intent-aware Encoder (PIE), which is designed to align encodings of utterances with their intent names. Specifically, we first train a tagger to identify key phrases within utterances that are crucial for interpreting intents. We then use these extracted phrases to create examples for pre-training a text encoder in a contrastive manner. As a result, our PIE model achieves up to 5.4% and 4.0% higher accuracy than the previous state-of-the-art pre-trained text encoder for the N-way zero- and one-shot settings on four IC datasets.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Mujeen Sung , James Gung , Elman Mansimov , Nikolaos Pappas , Raphael Shu , Salvatore Romeo , Yi Zhang , Vittorio Castelli

Topics

Machine Learning > Learning Types > Contrastive Learning Machine Learning > Learning Types > Zero-Shot Learning Natural Language Processing > Applications > Intent Classification Machine Learning > Learning Types > Few-Shot Learning

Keywords

contrastive learning zero-shot learning few-shot learning intent classification pre-trained encoder

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023