DA-Pred: Performance Prediction for Text Summarization under Domain-Shift and Instruct-Tuning

Anum Afzal; Florian Matthes; Alexander Fabbri

2025 EMNLP EMNLP 2025

DA-Pred: Performance Prediction for Text Summarization under Domain-Shift and Instruct-Tuning

Abstract

AbstractLarge Language Models (LLMs) often don’t perform as expected under Domain Shift or after Instruct-tuning. A reliable indicator of LLM performance in these settings could assist in decision-making. We present a method that uses the known performance in high-resource domains and fine-tuning settings to predict performance in low-resource domains or base models, respectively. In our paper, we formulate the task of performance prediction, construct a dataset for it, and train regression models to predict the said change in performance. Our proposed methodology is lightweight and, in practice, can help researchers & practitioners decide if resources should be allocated for data labeling and LLM Instruct-tuning.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Anum Afzal , Florian Matthes , Alexander Fabbri

Topics

Machine Learning > Core Methods > Regression Machine Learning > Application Areas > Domain Adaptation Machine Learning > Application Areas > Domain Generalization Machine Learning > Learning Types > Transfer Learning Artificial Intelligence > Core AI > Large Language Models Machine Learning > Learning Types > Domain Adaptation Natural Language Processing > Applications > Summarization

Keywords

domain adaptation text summarization instruction tuning domain shift regression model performance prediction large language model

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025