RankMean: Module-Level Importance Score for Merging Fine-tuned LLM Models

Gabriel Perin; Xuxi Chen; Shusen Liu; Bhavya Kailkhura; Zhangyang Wang; Brian Gallagher

2024 ACL ACL 2024

RankMean: Module-Level Importance Score for Merging Fine-tuned LLM Models

Abstract

AbstractTraditionally, developing new language models (LMs) capable of addressing multiple tasks involves fine-tuning pre-trained LMs using a wide collection of datasets, a process that often incurs significant computational expenses. Model merging emerges as a cost-effective alternative, allowing the integration of existing models fine-tuned on different tasks into a single model that performs well across all tasks, eliminating the need for additional training. In this paper, we propose RankMean, an algorithm for merging fine-tuned LMs without requiring any downstream data. RankMean determines merging coefficients based on the relative rankings of weight change magnitudes and applies these coefficients for module-wise integration of various fine-tuned models. Our experimental results demonstrate that RankMean outperforms existing baseline methods on multiple benchmarks. The code is available at https://github.com/VITA-Group/RankMean.

🧭 Keyword Pioneer — weight change magnitude

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Gabriel Perin , Xuxi Chen , Shusen Liu , Bhavya Kailkhura , Zhangyang Wang , Brian Gallagher

Topics

Machine Learning > Application Areas > Model Merging

Keywords

model merging fine-tuned model weight change magnitude module-wise integration

Download PDF

Related papers

Reinforcement Learning-Driven LLM Agent for Automated Attacks on LLMs 2024

EtymoLink: A Structured English Etymology Dataset 2024

Turkish Delights: A Dataset on Turkish Euphemisms 2024

Subjectivity Detection in English News using Large Language Models 2024

Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better 2024