LM-Cocktail: Resilient Tuning of Language Models via Model Merging

Shitao Xiao; Zheng Liu; Peitian Zhang; Xingrun Xing

2024 ACL ACL 2024

LM-Cocktail: Resilient Tuning of Language Models via Model Merging

Abstract

AbstractThe pre-trained language models are continually fine-tuned to better support downstream applications. However, this operation may result in significant performance degeneration on general tasks beyond the targeted domain. To overcome this problem, we propose LM-Cocktail which enables the fine-tuned model to stay resilient in general perspectives. Our method is conducted in the form of model merging, where the fine-tuned language model is merged with the pre-trained base model or the peer models from other domains through weighted average. Despite simplicity, LM-Cocktail is surprisingly effective: the resulted model is able to achieve a strong empirical performance in the whole scope of general tasks while preserving a superior capacity in its targeted domain.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Shitao Xiao , Zheng Liu , Peitian Zhang , Xingrun Xing

Topics

Machine Learning > Application Areas > Model Merging Natural Language Processing > Resources & Methods > Large Language Models Deep Learning > Optimization & Theory > Model Compression Deep Learning > Learning Types > Transfer Learning

Keywords

transfer learning model merging language model model fine-tuning model resilience weighted average

Download PDF

Related papers

Reinforcement Learning-Driven LLM Agent for Automated Attacks on LLMs 2024

EtymoLink: A Structured English Etymology Dataset 2024

Turkish Delights: A Dataset on Turkish Euphemisms 2024

Subjectivity Detection in English News using Large Language Models 2024

Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better 2024