Team NLLG submission for Eval4NLP 2023 Shared Task: Retrieval-Augmented In-Context Learning for NLG Evaluation

Daniil Larionov; Vasiliy Viskov; George Kokush; Alexander Panchenko; Steffen Eger

2023 IJCNLP IJCNLP 2023

Team NLLG submission for Eval4NLP 2023 Shared Task: Retrieval-Augmented In-Context Learning for NLG Evaluation

Abstract

AbstractIn this paper, we propose a retrieval-augmented in-context learning for natural language generation (NLG) evaluation. This method allows practitioners to utilize large language models (LLMs) for various NLG evaluation tasks without any fine-tuning. We apply our approach to Eval4NLP 2023 Shared Task in translation evaluation and summarization evaluation subtasks. The findings suggest that retrieval-augmented in-context learning is a promising approach for creating LLM-based evaluation metrics for NLG. Further research directions include exploring the performance of various publicly available LLM models and identifying which LLM properties help boost the quality of the metric.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Daniil Larionov , Vasiliy Viskov , George Kokush , Alexander Panchenko , Steffen Eger

Topics

Natural Language Processing > Generation > Text Generation Natural Language Processing > Resources & Methods > Large Language Models Artificial Intelligence > Core AI > Large Language Models Machine Learning > Learning Types > In-Context Learning

Keywords

in-context learning natural language generation text generation retrieval-augmented generation retrieval augmentation text evaluation natural language generation evaluation large language model

Download PDF

Related papers

On the Use of Language Models for Function Identification of Citations in Scholarly Papers 2023

Automatic Translation of Span-Prediction Datasets 2023

PACT: Pretraining with Adversarial Contrastive Learning for Text Classification 2023

VACASPATI: A Diverse Corpus of Bangla Literature 2023

Utilizing Weak Supervision to Generate Indonesian Conservation Datasets 2023