Meta-Adapters: Parameter Efficient Few-shot Fine-tuning through Meta-Learning

Trapit Bansal; Salaheddin Alzubi; Tong Wang; Andrew McCallum; Jay Yoon Lee; Jay-Yoon Lee

2022 AUTOML AutoML 2022

Meta-Adapters: Parameter Efficient Few-shot Fine-tuning through Meta-Learning

Abstract

Consistent improvements in the representational capacity of large pre-trained transformers has made it increasingly viable to serve these models as shared priors that can be fine-tuned on a large number of downstream tasks. However, fine-tuning the entire model for every task of interest makes a copy of all the model parameters, rendering such scenarios highly impractical. Recently introduced Adapter methods propose a promising alternative, one where only a small number of additional parameters are introduced per task specifically for fine-tuning. However, Adapters often require large amounts of task-specific data for good performance and don’t work well in data-scarce few-shot scenarios. In this paper, we approach parameter-efficient fine-tuning in few-shot settings from a meta-learning perspective. We introduce Meta-Adapters, which are small blocks of meta-learned adapter layers inserted in a pre-trained model that re-purpose a frozen pre-trained model into a parameter-efficient few-shot learner. Meta-Adapters perform competitively with state-of-the-art few-shot learning methods that require full fine-tuning, while only fine-tuning 0.6% of the parameters. We evaluate Meta-Adapters along with multiple transfer learning baselines on an evaluation suite of 17 classification tasks and find that they improve few-shot accuracy by a large margin over competitive parameter-efficient methods, while requiring significantly lesser parameters for fine-tuning. Moreover, when comparing few-shot prompting of GPT-3 against few-shot fine-tuning with Meta-Adapters, we find that Meta-Adapters perform competitively while working with pre-trained transformers that are many orders of magnitude (1590{\texttimes}) smaller in size than GPT-3.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Trapit Bansal , Salaheddin Alzubi , Tong Wang , Jay Yoon Lee , Jay-Yoon Lee , Andrew McCallum

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Artificial Intelligence > Learning Paradigms > Meta-Learning Machine Learning > Learning Types > Few-Shot Learning

Keywords

few-shot learning transfer learning pretrained transformer parameter-efficient training

Download PDF

Related papers

A Tree-Structured Multi-Task Model Recommender 2022

Syne Tune: A Library for Large Scale Hyperparameter Tuning and Reproducible Research 2022

DIFER: Differentiable Automated Feature Engineering 2022

Differentiable Architecture Search for Reinforcement Learning 2022

AutoCoG: A Unified Data-Model Co-Search Framework for Graph Neural Networks 2022