EnsLM: Ensemble Language Model for Data Diversity by Semantic Clustering

Zhibin Duan; Hao Zhang; Chaojie Wang; Zhengjue Wang; Bo Chen; Mingyuan Zhou

2021 ACL ACL 2021

EnsLM: Ensemble Language Model for Data Diversity by Semantic Clustering

Abstract

AbstractNatural language processing (NLP) often faces the problem of data diversity such as different domains, themes, styles, and so on. Therefore, a single language model (LM) is insufficient to learn all knowledge from diverse samples. To solve this problem, we firstly propose an autoencoding topic model with a mixture prior (mATM) to perform clustering for the data, where the clusters defined in semantic space describes the data diversity. Having obtained the clustering assignment for each sample, we develop the ensemble LM (EnsLM) with the technique of weight modulation. Specifically, EnsLM contains a backbone that is adjusted by a few modulated weights to fit for different sample clusters. As a result, the backbone learns the shared knowledge among all clusters while modulated weights extract the cluster-specific features. EnsLM can be trained jointly with mATM with a flexible LM backbone. We evaluate the effectiveness of both mATM and EnsLM on various tasks.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — data diversity

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Zhibin Duan , Hao Zhang , Chaojie Wang , Zhengjue Wang , Bo Chen , Mingyuan Zhou

Topics

Machine Learning > Core Methods > Clustering Machine Learning > Core Methods > Embedding Learning Natural Language Processing > Generation > Language Modeling Machine Learning > Learning Paradigms > Multi-Task Learning Deep Learning > Learning Types > Representation Learning Deep Learning > Models > Language Models

Keywords

representation learning ensemble learning language modeling topic model language model semantic clustering data diversity weight modulation

Download PDF

Related papers

Out-of-Scope Intent Detection with Self-Supervision and Discriminative Training 2021

A Non-Autoregressive Edit-Based Approach to Controllable Text Simplification 2021

How Did This Get Funded?! Automatically Identifying Quirky Scientific Achievements 2021

Exploring Discourse Structures for Argument Impact Classification 2021

Language Embeddings for Typology and Cross-lingual Transfer Learning 2021