Computer Science > Computation and Language

arXiv:2410.10075 (cs)

[Submitted on 14 Oct 2024 (v1), last revised 15 Oct 2024 (this version, v2)]

Title:RoCoFT: Efficient Finetuning of Large Language Models with Row-Column Updates

Authors:Md Kowsher, Tara Esmaeilbeig, Chun-Nam Yu, Mojtaba Soltanalian, Niloofar Yousefi

Abstract:We propose RoCoFT, a parameter-efficient fine-tuning method for large-scale language models (LMs) based on updating only a few rows and columns of the weight matrices in transformers. Through extensive experiments with medium-size LMs like BERT and RoBERTa, and larger LMs like Bloom-7B, Llama2-7B, and Llama2-13B, we show that our method gives comparable or better accuracies than state-of-art PEFT methods while also being more memory and computation-efficient. We also study the reason behind the effectiveness of our method with tools from neural tangent kernel theory. We empirically demonstrate that our kernel, constructed using a restricted set of row and column parameters, are numerically close to the full-parameter kernel and gives comparable classification performance. Ablation studies are conducted to investigate the impact of different algorithmic choices, including the selection strategy for rows and columns as well as the optimal rank for effective implementation of our method.

Comments:	RoCoFT is a parameter-efficient method
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2410.10075 [cs.CL]
	(or arXiv:2410.10075v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.10075

Submission history

From: Md Kowsher [view email]
[v1] Mon, 14 Oct 2024 01:36:24 UTC (3,332 KB)
[v2] Tue, 15 Oct 2024 04:00:27 UTC (3,332 KB)

Computer Science > Computation and Language

Title:RoCoFT: Efficient Finetuning of Large Language Models with Row-Column Updates

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:RoCoFT: Efficient Finetuning of Large Language Models with Row-Column Updates

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators