AdapterDrop - On the Efficiency of Adapters in Transformers.

Andreas Rücklé,Gregor Geigle,Max Glockner,Tilman Beck,Jonas Pfeiffer,Nils Reimers,Iryna Gurevych

EMNLP（2021）

引用 144|浏览67

暂无评分

摘要

Massively pre-trained transformer models are computationally expensive to fine-tune, slow for inference, and have large storage requirements. Recent approaches tackle these shortcomings by training smaller models, dynamically reducing the model size, and by training light-weight adapters. In this paper, we propose AdapterDrop, removing adapters from lower transformer layers during training and inference, which incorporates concepts from all three directions. We show that AdapterDrop can dynamically reduce the computational overhead when performing inference over multiple tasks simultaneously, with minimal decrease in task performances. We further prune adapters from AdapterFusion, which improves the inference efficiency while maintaining the task performances entirely.

查看译文

关键词

adapters,efficiency

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要