Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge

Yan-Lun Chen; Yi-Ru Wei; Chia-Yi Hsu; Chia-Mu Yu; Chun-Ying Huang; Ying-Dar Lin; Yu-Sung Wu; Wei-Bin Lee

2025 EMNLP EMNLP 2025

Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge

Abstract

AbstractLarge language models (LLMs) demonstrate strong task-specific capabilities through fine-tuning, but merging multiple fine-tuned models often leads to degraded performance due to overlapping instruction-following components. Task Arithmetic (TA), which combines task vectors derived from fine-tuning, enables multi-task learning and task forgetting but struggles to isolate task-specific knowledge from general instruction-following behavior. To address this, we propose Layer-Aware Task Arithmetic (LATA), a novel approach that assigns layer-specific weights to task vectors based on their alignment with instruction-following or task-specific components. By amplifying task-relevant layers and attenuating instruction-following layers, LATA improves task learning and forgetting performance while preserving overall model utility. Experiments on multiple benchmarks, including WikiText-2, GSM8K, and HumanEval, demonstrate that LATA outperforms existing methods in both multi-task learning and selective task forgetting, achieving higher task accuracy and alignment with minimal degradation in output quality. Our findings highlight the importance of layer-wise analysis in disentangling task-specific and general-purpose knowledge, offering a robust framework for efficient model merging and editing.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yan-Lun Chen , Yi-Ru Wei , Chia-Yi Hsu , Chia-Mu Yu , Chun-Ying Huang , Ying-Dar Lin , Yu-Sung Wu , Wei-Bin Lee

Topics

Artificial Intelligence > Core AI > Model Compression Machine Learning > Application Areas > Model Merging Machine Learning > Learning Types > Transfer Learning Artificial Intelligence > Core AI > Knowledge Editing Deep Learning > Techniques > Knowledge Distillation

Keywords

knowledge editing instruction following model merging layer-wise analysis task vector task arithmetic

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025