Measuring the Measuring Tools: An Automatic Evaluation of Semantic Metrics for Text Corpora

George Kour; Samuel Ackerman; Eitan Daniel Farchi; Orna Raz; Boaz Carmeli; Ateret Anaby Tavor

2022 EMNLP EMNLP 2022

Measuring the Measuring Tools: An Automatic Evaluation of Semantic Metrics for Text Corpora

Abstract

AbstractSimilarity metrics for text corpora are becoming critical due to the tremendous growth in the number of generative models. These similarity metrics measure the semantic gap between human and machine-generated text on the corpus level. However, standard methods for evaluating the characteristics of these metrics have yet to be established. We propose a set of automatic measures for evaluating the characteristics of semantic similarity metrics for text corpora. Our measures allow us to sensibly compare and identify the strengths and weaknesses of these metrics. We demonstrate the effectiveness of our evaluation measures in capturing fundamental characteristics by comparing it to a collection of classical and state-of-the-art metrics. Our measures revealed that recent metrics are becoming better in identifying semantic distributional mismatch while classical metrics are more sensitive to perturbations in the surface text levels.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — semantic similarity metrics

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

George Kour , Samuel Ackerman , Eitan Daniel Farchi , Orna Raz , Boaz Carmeli , Ateret Anaby Tavor

Topics

Machine Learning > Core Methods > Metric Learning Machine Learning > Optimization & Theory > Optimization Natural Language Processing > Applications > Text Generation Machine Learning > Optimization & Theory > Evaluation

Keywords

Download PDF

Generative Entity Typing with Curriculum Learning 2022

Towards Reinterpreting Neural Topic Models via Composite Activations 2022

Weakly Supervised Headline Dependency Parsing 2022

Cross-modal Transfer Between Vision and Language for Protest Detection 2022

Measuring the Measuring Tools: An Automatic Evaluation of Semantic Metrics for Text Corpora

Abstract

Authors

Topics

Keywords

Related papers