Predicting generalization performance with correctness discriminators

Yuekun Yao; Alexander Koller

2024 EMNLP EMNLP 2024

Predicting generalization performance with correctness discriminators

Abstract

AbstractThe ability to predict an NLP model’s accuracy on unseen, potentially out-of-distribution data is a prerequisite for trustworthiness. We present a novel model that establishes upper and lower bounds on the accuracy, without requiring gold labels for the unseen data. We achieve this by training a *discriminator* which predicts whether the output of a given sequence-to-sequence model is correct or not. We show across a variety of tagging, parsing, and semantic parsing tasks that the gold accuracy is reliably between the predicted upper and lower bounds, and that these bounds are remarkably close together.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yuekun Yao , Alexander Koller

Topics

Machine Learning > Core Methods > Classification Machine Learning > Core Methods > Embedding Learning Machine Learning > Optimization & Theory > Theory Artificial Intelligence > Core AI > Large Language Models Machine Learning > Optimization & Theory > Evaluation Machine Learning > Learning Types > Evaluation Natural Language Processing > Applications > Natural Language Understanding Machine Learning > Learning Types > Generalization

Keywords

out-of-distribution detection generalization performance sequence-to-sequence model out-of-distribution datum accuracy prediction correctness discriminator accuracy bound

Download PDF

Related papers

EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation 2024

Learning to Extract Structured Entities Using Language Models 2024

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis 2024

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages 2024