Personalized Federated Learning for Text Classification with Gradient-Free Prompt Tuning

Rui Wang; Tong Yu; Ruiyi Zhang; Sungchul Kim; Ryan Rossi; Handong Zhao; Junda Wu; Subrata Mitra; Lina Yao; Ricardo Henao

2024 NAACL NAACL 2024

Personalized Federated Learning for Text Classification with Gradient-Free Prompt Tuning

Abstract

AbstractIn this paper, we study personalized federated learning for text classification with Pretrained Language Models (PLMs). We identify two challenges in efficiently leveraging PLMs for personalized federated learning: 1) Communication. PLMs are usually large in size, e.g., with hundreds of millions of parameters, inducing huge communication cost in a federated setting. 2) Local Training. Training with PLMs generally requires back-propagation, during which memory consumption can be several times that of the forward-propagation. This may not be affordable when the PLMs are trained locally on the clients that are resource constrained, e.g., mobile devices with limited access to memory resources. Additionally, the proprietary PLMs can be provided as concealed APIs, for which the back-propagation operations may not be available. In solving these, we propose a training framework that includes an approach of discrete local search for gradient-free local training, along with a compression mechanism inspired from the linear word analogy that allows communicating with discretely indexed tokens, thus significantly reducing the communication cost. Experiments show that our gradient-free framework achieves superior performance compared with baselines.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Rui Wang , Tong Yu , Ruiyi Zhang , Sungchul Kim , Ryan Rossi , Handong Zhao , Junda Wu , Subrata Mitra , Lina Yao , Ricardo Henao

Topics

Artificial Intelligence > Learning Paradigms > Federated Learning Machine Learning > Application Areas > Efficient Computing Natural Language Processing > Applications > Text Classification

Keywords

federated learning text classification communication efficiency prompt tuning gradient-free optimization pretrained language model

Download PDF

Related papers

Working Alliance Transformer for Psychotherapy Dialogue Classification 2024

Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences 2024

Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study 2024

TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation 2024

Extractive Summarization with Text Generator 2024