2018
ACL
ACL 2018
Universal Language Model Fine-tuning for Text Classification
Abstract
AbstractInductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch. We propose Universal Language Model Fine-tuning (ULMFiT), an effective transfer learning method that can be applied to any task in NLP, and introduce techniques that are key for fine-tuning a language model. Our method significantly outperforms the state-of-the-art on six text classification tasks, reducing the error by 18-24% on the majority of datasets. Furthermore, with only 100 labeled examples, it matches the performance of training from scratch on 100 times more data. We open-source our pretrained models and code.
🌉
Interdisciplinary Bridge
— Machine Learning and Natural Language Processing
📈
Trend Setter
— Large Language Models
🐣
Hot Topic Early Bird
— text classification
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio
🧭
Keyword Pioneer
— language model fine-tuning
Authors
Topics
Natural Language Processing > Applications > Text Classification
Natural Language Processing > Resources & Methods > Large Language Models
Machine Learning > Learning Paradigms > Transfer Learning
Machine Learning > Learning Types > Transfer Learning
Deep Learning > Learning Types > Transfer Learning
Deep Learning > Learning Types > Fine-Tuning