2021
EMNLP
EMNLP 2021
Efficient Contrastive Learning via Novel Data Augmentation and Curriculum Learning
Abstract
AbstractWe introduce EfficientCL, a memory-efficient continual pretraining method that applies contrastive learning with novel data augmentation and curriculum learning. For data augmentation, we stack two types of operation sequentially: cutoff and PCA jittering. While pretraining steps proceed, we apply curriculum learning by incrementing the augmentation degree for each difficulty step. After data augmentation is finished, contrastive learning is applied on projected embeddings of original and augmented examples. When finetuned on GLUE benchmark, our model outperforms baseline models, especially for sentence-level tasks. Additionally, this improvement is capable with only 70% of computational memory compared to the baseline model.
🌉
Interdisciplinary Bridge
— Deep Learning and Machine Learning and Natural Language Processing
📈
Trend Setter
— Curriculum Learning
🐣
Hot Topic Early Bird
— continual pretraining
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Learning Types > Contrastive Learning
Machine Learning > Learning Types > Self-Supervised Learning
Natural Language Processing > Generation > Language Modeling
Machine Learning > Learning Paradigms > Few-Shot Learning
Deep Learning > Learning Types > Contrastive Learning
Deep Learning > Learning Types > Data Augmentation
Deep Learning > Learning Types > Curriculum Learning