Long Document Summarization in a Low Resource Setting using Pretrained Language Models

Ahsaas Bajaj; Pavitra Dangati; Kalpesh Krishna; Pradhiksha Ashok Kumar; Rheeya Uppaal; Bradford Windsor; Eliot Brenner; Dominic Dotterrer; Rajarshi Das; Andrew McCallum

2021 IJCNLP IJCNLP 2021

Long Document Summarization in a Low Resource Setting using Pretrained Language Models

Abstract

AbstractAbstractive summarization is the task of compressing a long document into a coherent short document while retaining salient information. Modern abstractive summarization methods are based on deep neural networks which often require large training datasets. Since collecting summarization datasets is an expensive and time-consuming task, practical industrial settings are usually low-resource. In this paper, we study a challenging low-resource setting of summarizing long legal briefs with an average source document length of 4268 words and only 120 available (document, summary) pairs. To account for data scarcity, we used a modern pre-trained abstractive summarizer BART, which only achieves 17.9 ROUGE-L as it struggles with long documents. We thus attempt to compress these long documents by identifying salient sentences in the source which best ground the summary, using a novel algorithm based on GPT-2 language model perplexity scores, that operates within the low resource regime. On feeding the compressed documents to BART, we observe a 6.0 ROUGE-L improvement. Our method also beats several competitive salience detection baselines. Furthermore, the identified salient sentences tend to agree with independent human labeling by domain experts.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning

🐣 Hot Topic Early Bird — low-resource learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Ahsaas Bajaj , Pavitra Dangati , Kalpesh Krishna , Pradhiksha Ashok Kumar , Rheeya Uppaal , Bradford Windsor , Eliot Brenner , Dominic Dotterrer , Rajarshi Das , Andrew McCallum

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Deep Learning > Models > Generative Models Deep Learning > Techniques > Pretraining

Keywords

low-resource learning pretrained language model abstractive summarization document compression salience detection

Download PDF

Flesch-Kincaid is Not a Text Simplification Evaluation Metric 2021

Semantic Similarity Based Evaluation for Abstractive News Summarization 2021

Figurative Language in Recognizing Textual Entailment 2021

Sequence Models for Computational Etymology of Borrowings 2021

Long Document Summarization in a Low Resource Setting using Pretrained Language Models

Abstract

Authors

Topics

Keywords

Related papers