EASE: Extractive-Abstractive Summarization End-to-End using the Information Bottleneck Principle

Haoran Li; Arash Einolghozati; Srinivasan Iyer; Bhargavi Paranjape; Yashar Mehdad; Sonal Gupta; Marjan Ghazvininejad

2021 EMNLP EMNLP 2021

EASE: Extractive-Abstractive Summarization End-to-End using the Information Bottleneck Principle

Abstract

AbstractCurrent abstractive summarization systems outperform their extractive counterparts, but their widespread adoption is inhibited by the inherent lack of interpretability. Extractive summarization systems, though interpretable, suffer from redundancy and possible lack of coherence. To achieve the best of both worlds, we propose EASE, an extractive-abstractive framework that generates concise abstractive summaries that can be traced back to an extractive summary. Our framework can be applied to any evidence-based text generation problem and can accommodate various pretrained models in its simple architecture. We use the Information Bottleneck principle to jointly train the extraction and abstraction in an end-to-end fashion. Inspired by previous research that humans use a two-stage framework to summarize long documents (Jing and McKeown, 2000), our framework first extracts a pre-defined amount of evidence spans and then generates a summary using only the evidence. Using automatic and human evaluations, we show that the generated summaries are better than strong extractive and extractive-abstractive baselines.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Haoran Li , Arash Einolghozati , Srinivasan Iyer , Bhargavi Paranjape , Yashar Mehdad , Sonal Gupta , Marjan Ghazvininejad

Topics

Machine Learning > Core Methods > Representation Learning Natural Language Processing > Generation > Summarization Deep Learning > Optimization & Theory > Optimization Artificial Intelligence > Core AI > Natural Language Processing

Keywords

information bottleneck extractive summarization text generation end-to-end learning pretrained model abstractive summarization evidence extraction

Download PDF

Related papers

Continual Learning in Multilingual NMT via Language-Specific Embeddings 2021

MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents 2021

Efficient Multi-Task Auxiliary Learning: Selecting Auxiliary Data by Feature Similarity 2021

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings 2021

Semantics-Preserved Data Augmentation for Aspect-Based Sentiment Analysis 2021