2023
ACL
ACL 2023
Teaching the Pre-trained Model to Generate Simple Texts for Text Simplification
Abstract
AbstractRandomly masking text spans in ordinary texts in the pre-training stage hardly allows models to acquire the ability to generate simple texts. It can hurt the performance of pre-trained models on text simplification tasks. In this paper, we propose a new continued pre-training strategy to teach the pre-trained model to generate simple texts. We continue pre-training BART, a representative model, to obtain SimpleBART. It consistently and significantly improves the results on lexical simplification, sentence simplification, and document-level simplification tasks over BART. At the end, we compare SimpleBART with several representative large language models (LLMs).
🌉
Interdisciplinary Bridge
— Machine Learning and Natural Language Processing
🧭
Keyword Pioneer
— continued pre-training
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio
Authors
Topics
Natural Language Processing > Generation > Text Generation
Natural Language Processing > Applications > Text Classification
Machine Learning > Learning Types > Transfer Learning
Deep Learning > Learning Types > Transfer Learning
Deep Learning > Learning Types > Fine-Tuning
Natural Language Processing > Applications > Text Simplification