TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Music Question Answering	MusicQA	LLaMA Adapter	BLEU	0.273	# 2
Music Question Answering	MusicQA	LLaMA Adapter	METEOR	0.334	# 2
Music Question Answering	MusicQA	LLaMA Adapter	ROUGE	0.413	# 2
Music Question Answering	MusicQA	LLaMA Adapter	BERT Score	0.895	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/llama-adapter-efficient-fine-tuning-of/music-question-answering-on-musicqa)](https://paperswithcode.com/sota/music-question-answering-on-musicqa?p=llama-adapter-efficient-fine-tuning-of)`

LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention

28 Mar 2023 · Renrui Zhang, Jiaming Han, Chris Liu, Peng Gao, Aojun Zhou, Xiangfei Hu, Shilin Yan, Pan Lu, Hongsheng Li, Yu Qiao ·

We present LLaMA-Adapter, a lightweight adaption method to efficiently fine-tune LLaMA into an instruction-following model. Using 52K self-instruct demonstrations, LLaMA-Adapter only introduces 1.2M learnable parameters upon the frozen LLaMA 7B model, and costs less than one hour for fine-tuning on 8 A100 GPUs. Specifically, we adopt a set of learnable adaption prompts, and prepend them to the word tokens at higher transformer layers. Then, a zero-initialized attention mechanism with zero gating is proposed, which adaptively injects the new instructional cues into LLaMA, while effectively preserves its pre-trained knowledge. With our efficient training, LLaMA-Adapter can generate high-quality responses, comparable to Alpaca with fully fine-tuned 7B parameters. Besides language commands, our approach can be simply extended to multi-modal instructions for learning image-conditioned LLaMA model, which achieves superior reasoning performance on ScienceQA and COCO Caption benchmarks. Furthermore, we also evaluate the zero-initialized attention mechanism for fine-tuning other pre-trained models (ViT, RoBERTa) on traditional vision and language tasks, demonstrating the superior generalization capacity of our approach. Code is released at https://github.com/OpenGVLab/LLaMA-Adapter.

PDF Abstract