2025
EMNLP
EMNLP 2025
DA-Pred: Performance Prediction for Text Summarization under Domain-Shift and Instruct-Tuning
Abstract
AbstractLarge Language Models (LLMs) often donโt perform as expected under Domain Shift or after Instruct-tuning. A reliable indicator of LLM performance in these settings could assist in decision-making. We present a method that uses the known performance in high-resource domains and fine-tuning settings to predict performance in low-resource domains or base models, respectively. In our paper, we formulate the task of performance prediction, construct a dataset for it, and train regression models to predict the said change in performance. Our proposed methodology is lightweight and, in practice, can help researchers & practitioners decide if resources should be allocated for data labeling and LLM Instruct-tuning.
๐
Interdisciplinary Bridge
โ Artificial Intelligence and Machine Learning and Natural Language Processing
๐
Cross-Pollinator
โ Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Core Methods > Regression
Machine Learning > Application Areas > Domain Adaptation
Machine Learning > Application Areas > Domain Generalization
Machine Learning > Learning Types > Transfer Learning
Artificial Intelligence > Core AI > Large Language Models
Machine Learning > Learning Types > Domain Adaptation
Natural Language Processing > Applications > Summarization