Neural Text Summarization: A Critical Evaluation

Wojciech Kryscinski; Nitish Shirish Keskar; Bryan McCann; Caiming Xiong; Richard Socher

2019 EMNLP EMNLP 2019

Neural Text Summarization: A Critical Evaluation

Abstract

AbstractText summarization aims at compressing long documents into a shorter form that conveys the most important parts of the original document. Despite increased interest in the community and notable research effort, progress on benchmark datasets has stagnated. We critically evaluate key ingredients of the current research setup: datasets, evaluation metrics, and models, and highlight three primary shortcomings: 1) automatically collected datasets leave the task underconstrained and may contain noise detrimental to training and evaluation, 2) current evaluation protocol is weakly correlated with human judgment and does not account for important characteristics such as factual correctness, 3) models overfit to layout biases of current datasets and offer limited diversity in their outputs.

🧭 Keyword Pioneer — factual correctness

🐣 Hot Topic Early Bird — evaluation metrics

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Wojciech Kryscinski , Nitish Shirish Keskar , Bryan McCann , Caiming Xiong , Richard Socher

Topics

Natural Language Processing > Generation > Summarization

Keywords

text summarization evaluation metrics factual correctness dataset analysis neural network

Download PDF

Related papers

Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation 2019

Chains-of-Reasoning at TextGraphs 2019 Shared Task: Reasoning over Chains of Facts for Explainable Multi-hop Inference 2019

A Boundary-aware Neural Model for Nested Named Entity Recognition 2019

Iterative Dual Domain Adaptation for Neural Machine Translation 2019

A Multi-Pairwise Extension of Procrustes Analysis for Multilingual Word Translation 2019