2023
EMNLP
EMNLP 2023
Enhancing Abstractiveness of Summarization Models through Calibrated Distillation
Abstract
AbstractIn this paper, we propose a novel approach named DisCal to enhance the level of abstractiveness (measured by n-gram overlap) without sacrificing the informativeness (measured by ROUGE) of generated summaries. DisCal exposes diverse pseudo summaries with two supervision to the student model. Firstly, the best pseudo summary is identified in terms of abstractiveness and informativeness and used for sequence-level distillation. Secondly, their ranks are used to ensure the student model to assign higher prediction scores to summaries with higher ranks. Our experiments show that DisCal outperforms prior methods in abstractive summarization distillation, producing highly abstractive and informative summaries.
🌉
Interdisciplinary Bridge
— Deep Learning and Machine Learning and Natural Language Processing
🧭
Keyword Pioneer
— calibrated distillation
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Deep Learning > Models > Generative Models
Deep Learning > Techniques > Model Architecture
Natural Language Processing > Generation > Summarization
Natural Language Processing > Applications > Summarization
Machine Learning > Learning Types > Knowledge Distillation
Deep Learning > Techniques > Knowledge Distillation
Deep Learning > Learning Types > Model Compression