Perception Score: A Learned Metric for Open-ended Text Generation Evaluation

Jing Gu; Qingyang Wu; Zhou Yu

2021 AAAI AAAI 2021

Perception Score: A Learned Metric for Open-ended Text Generation Evaluation

Abstract

Abstract Automatic evaluation for open-ended natural language generation tasks remains a challenge. We propose a learned evaluation metric: Perception Score. It utilizes a pre-trained model and considers context information for conditional generation. Perception Score assigns a holistic score along with the uncertainty measurement. We conduct experiments on three open-ended conditional generation tasks and two open-ended unconditional generation tasks. Perception Score achieves state-of-the-art results on all the tasks consistently in terms of correlation with human evaluation scores.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

📈 Trend Setter — Evaluation

🧭 Keyword Pioneer — open-ended generation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Jing Gu , Qingyang Wu , Zhou Yu

Topics

Natural Language Processing > Generation > Language Modeling Natural Language Processing > Generation > Text Generation Machine Learning > Learning Types > Evaluation Deep Learning > Learning Types > Evaluation

Keywords

conditional generation human evaluation text generation evaluation open-ended generation learned metric uncertainty measurement perception score

Download PDF

Related papers

Contextual Conditional Reasoning 2021

Attention Beam: An Image Captioning Approach (Student Abstract) 2021

Movie Summarization via Sparse Graph Construction 2021

Text Analysis for Understanding Symptoms of Social Anxiety in Student Veterans 2021

ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs 2021