UER: An Open-Source Toolkit for Pre-training Models

Zhe Zhao; Hui Chen; Jinbin Zhang; Xin Zhao; Tao Liu; Wei Lu; Xi Chen; Haotang Deng; Qi Ju; Xiaoyong Du

2019 EMNLP EMNLP 2019

UER: An Open-Source Toolkit for Pre-training Models

Abstract

AbstractExisting works, including ELMO and BERT, have revealed the importance of pre-training for NLP tasks. While there does not exist a single pre-training model that works best in all cases, it is of necessity to develop a framework that is able to deploy various pre-training models efficiently. For this purpose, we propose an assemble-on-demand pre-training toolkit, namely Universal Encoder Representations (UER). UER is loosely coupled, and encapsulated with rich modules. By assembling modules on demand, users can either reproduce a state-of-the-art pre-training model or develop a pre-training model that remains unexplored. With UER, we have built a model zoo, which contains pre-trained models based on different corpora, encoders, and targets (objectives). With proper pre-trained models, we could achieve new state-of-the-art results on a range of downstream datasets.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

📈 Trend Setter — Pretraining

🧭 Keyword Pioneer — model zoo

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zhe Zhao , Hui Chen , Jinbin Zhang , Xin Zhao , Tao Liu , Wei Lu , Xi Chen , Haotang Deng , Qi Ju , Xiaoyong Du

Topics

Artificial Intelligence > Core AI > Foundation Models Machine Learning > Core Methods > Representation Learning Machine Learning > Learning Types > Self-Supervised Learning Machine Learning > Learning Types > Transfer Learning Natural Language Processing > Resources & Methods > Language Modeling Deep Learning > Learning Types > Self-Supervised Learning Deep Learning > Learning Types > Transfer Learning Natural Language Processing > Resources & Methods > Pretraining

Keywords

model compression transfer learning text representation language model downstream task model zoo pre-training toolkit universal encoder representation encoder module

Download PDF

Related papers

Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation 2019

Chains-of-Reasoning at TextGraphs 2019 Shared Task: Reasoning over Chains of Facts for Explainable Multi-hop Inference 2019

A Boundary-aware Neural Model for Nested Named Entity Recognition 2019

Iterative Dual Domain Adaptation for Neural Machine Translation 2019

A Multi-Pairwise Extension of Procrustes Analysis for Multilingual Word Translation 2019