Enhancing Abstractiveness of Summarization Models through Calibrated Distillation

Hwanjun Song; Igor Shalyminov; Hang Su; Siffi Singh; Kaisheng Yao; Saab Mansour

2023 EMNLP EMNLP 2023

Enhancing Abstractiveness of Summarization Models through Calibrated Distillation

Abstract

AbstractIn this paper, we propose a novel approach named DisCal to enhance the level of abstractiveness (measured by n-gram overlap) without sacrificing the informativeness (measured by ROUGE) of generated summaries. DisCal exposes diverse pseudo summaries with two supervision to the student model. Firstly, the best pseudo summary is identified in terms of abstractiveness and informativeness and used for sequence-level distillation. Secondly, their ranks are used to ensure the student model to assign higher prediction scores to summaries with higher ranks. Our experiments show that DisCal outperforms prior methods in abstractive summarization distillation, producing highly abstractive and informative summaries.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — calibrated distillation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Hwanjun Song , Igor Shalyminov , Hang Su , Siffi Singh , Kaisheng Yao , Saab Mansour

Topics

Deep Learning > Models > Generative Models Deep Learning > Techniques > Model Architecture Natural Language Processing > Generation > Summarization Natural Language Processing > Applications > Summarization Machine Learning > Learning Types > Knowledge Distillation Deep Learning > Techniques > Knowledge Distillation Deep Learning > Learning Types > Model Compression

Keywords

model compression knowledge distillation text generation abstractive summarization sequence-level distillation calibrated distillation pseudo summary

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023