ReFT: Representation Finetuning for Language Models

Zhengxuan Wu; Aryaman Arora; Zheng Wang; Atticus Geiger; Dan Jurafsky; Christopher D. Manning; Christopher Potts

2024 NIPS NeurIPS 2024

ReFT: Representation Finetuning for Language Models

Abstract

Parameter-efficient finetuning (PEFT) methods seek to adapt large neural models via updates to a small number of weights. However, much prior interpretability work has shown that representations encode rich semantic information, suggesting that editing representations might be a more powerful alternative. We pursue this hypothesis by developing a family of Representation Finetuning (ReFT) methods. ReFT methods operate on a frozen base model and learn task-specific interventions on hidden representations. We define a strong instance of the ReFT family, Low-rank Linear Subspace ReFT (LoReFT), and we identify an ablation of this method that trades some performance for increased efficiency. Both are drop-in replacements for existing PEFTs and learn interventions that are 15x--65x more parameter-efficient than LoRA. We showcase LoReFT on eight commonsense reasoning tasks, four arithmetic reasoning tasks, instruction-tuning, and GLUE. In all these evaluations, our ReFTs deliver the best balance of efficiency and performance, and almost always outperform state-of-the-art PEFTs. Upon publication, we will publicly release our generic ReFT training library.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — intervention mechanism

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Zhengxuan Wu , Aryaman Arora , Zheng Wang , Atticus Geiger , Dan Jurafsky , Christopher D. Manning , Christopher Potts

Topics

Machine Learning > Core Methods > Representation Learning Natural Language Processing > Resources & Methods > Large Language Models Machine Learning > Learning Types > Representation Learning Deep Learning > Models > Large Language Models Machine Learning > Learning Types > Fine-Tuning Deep Learning > Techniques > Transfer Learning Deep Learning > Learning Types > Transfer Learning Deep Learning > Learning Types > Fine-Tuning

Keywords

representation learning transfer learning instruction tuning language model parameter-efficient fine-tuning hidden representation parameter-efficient finetuning intervention mechanism commonsense reasoning large language model parameter efficient finetuning representation finetuning low-rank linear subspace

Download PDF

Related papers

SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers 2024

Training for Stable Explanation for Free 2024

NeuralSolver: Learning Algorithms For Consistent and Efficient Extrapolation Across General Tasks 2024

Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation Mismatch 2024

MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence 2024