ESTIME: Estimation of Summary-to-Text Inconsistency by Mismatched Embeddings

Oleg Vasilyev; John Bohannon

2021 EMNLP EMNLP 2021

ESTIME: Estimation of Summary-to-Text Inconsistency by Mismatched Embeddings

Abstract

AbstractWe propose a new reference-free summary quality evaluation measure, with emphasis on the faithfulness. The measure is based on finding and counting all probable potential inconsistencies of the summary with respect to the source document. The proposed ESTIME, Estimator of Summary-to-Text Inconsistency by Mismatched Embeddings, correlates with expert scores in summary-level SummEval dataset stronger than other common evaluation measures not only in Consistency but also in Fluency. We also introduce a method of generating subtle factual errors in human summaries. We show that ESTIME is more sensitive to subtle errors than other common evaluation measures.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Natural Language Processing

📈 Trend Setter — Evaluation

🧭 Keyword Pioneer — text inconsistency

🐣 Hot Topic Early Bird — summarization evaluation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Oleg Vasilyev , John Bohannon

Topics

Natural Language Processing > Generation > Summarization Natural Language Processing > Applications > Text Classification Natural Language Processing > Applications > Summarization Deep Learning > Techniques > Representation Learning Artificial Intelligence > Core AI > Natural Language Processing Deep Learning > Learning Types > Evaluation

Keywords

embedding learning text generation summarization evaluation semantic embedding reference-free evaluation semantic inconsistency factual error summary evaluation text inconsistency

Download PDF

Related papers

Continual Learning in Multilingual NMT via Language-Specific Embeddings 2021

MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents 2021

Efficient Multi-Task Auxiliary Learning: Selecting Auxiliary Data by Feature Similarity 2021

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings 2021

Semantics-Preserved Data Augmentation for Aspect-Based Sentiment Analysis 2021