SWIFT: A Scalable Lightweight Infrastructure for Fine-Tuning

Yuze Zhao; Jintao Huang; Jinghan Hu; Xingjun Wang; Yunlin Mao; Daoze Zhang; Zeyinzi Jiang; Zhikai Wu; Baole Ai; Ang Wang; Wenmeng Zhou; Yingda Chen

2025 AAAI AAAI 2025

SWIFT: A Scalable Lightweight Infrastructure for Fine-Tuning

Abstract

Abstract Recent development in Large Language Models (LLMs) and Multi-modal Large Language Models (MLLMs) have achieved superior performance and generalization capabilities, covered extensive areas of traditional tasks. However, existing large model training frameworks support only a limited number of models and techniques, particularly lacking in support for new models, which makes fine-tuning LLMs challenging for most developers. Therefore, we develop SWIFT, a customizable one-stop infrastructure for large models. With support of over 350+ LLMs and 80+ MLLMs, SWIFT stands as the open-source framework that provide the most comprehensive support for fine-tuning large models. In particular, it is the first training framework that provides systematic support for MLLMs. Moreover, SWIFT integrates post-training processes such as inference, evaluation, and quantization, to facilitate fast adoptions of large models in various application scenarios, offering helpful utilities like benchmark comparisons among different training techniques.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yuze Zhao , Jintao Huang , Jinghan Hu , Xingjun Wang , Yunlin Mao , Daoze Zhang , Zeyinzi Jiang , Zhikai Wu , Baole Ai , Ang Wang , Wenmeng Zhou , Yingda Chen

Topics

Machine Learning > Application Areas > Efficient Computing Natural Language Processing > Resources & Methods > Large Language Models Deep Learning > Models > Large Language Models Deep Learning > Techniques > Transfer Learning

Keywords

model quantization multi-modal learning model training large language model

Download PDF

Related papers

BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving 2025

APIRL: Deep Reinforcement Learning for REST API Fuzzing 2025

Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation 2025

3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly Detection 2025

Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics 2025