2017
EACL
EACL 2017
The limits of automatic summarisation according to ROUGE
Abstract
AbstractThis paper discusses some central caveats of summarisation, incurred in the use of the ROUGE metric for evaluation, with respect to optimal solutions. The task is NP-hard, of which we give the first proof. Still, as we show empirically for three central benchmark datasets for the task, greedy algorithms empirically seem to perform optimally according to the metric. Additionally, overall quality assurance is problematic: there is no natural upper bound on the quality of summarisation systems, and even humans are excluded from performing optimal summarisation.
🌉
Interdisciplinary Bridge
— Machine Learning and Mathematics & Optimization and Natural Language Processing
🧭
Keyword Pioneer
— evaluation benchmark
🐣
Hot Topic Early Bird
— text summarization
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Optimization & Theory > Theory
Natural Language Processing > Generation > Summarization
Mathematics & Optimization > Optimization > Combinatorial Optimization
Mathematics & Optimization > Optimization > Discrete Optimization
Natural Language Processing > Applications > Summarization