Computer Science > Computation and Language

arXiv:2410.04060 (cs)

[Submitted on 5 Oct 2024 (v1), last revised 2 Feb 2025 (this version, v3)]

Title:LoRTA: Low Rank Tensor Adaptation of Large Language Models

Authors:Ignacio Hounie, Charilaos Kanatsoulis, Arnuv Tandon, Alejandro Ribeiro

Abstract:Low Rank Adaptation (LoRA) is a popular Parameter Efficient Fine Tuning (PEFT) method that effectively adapts large pre-trained models for downstream tasks. LoRA parameterizes model updates using low-rank matrices at each layer, significantly reducing the number of trainable parameters and, consequently, resource requirements during fine-tuning. However, the lower bound on the number of trainable parameters remains high due to the use of the low-rank matrix model. Recent works have addressed this limitation by proposing low rank tensor parameterizations for model updates. However, they only exploit redundancy across layers, or tensorize individual matrices using ad-hoc schemes that introduce additional hyperparameters. In this work, we propose a higher-order Candecomp/Parafac (CP) decomposition, enabling a more compact and flexible representation compared to existing matrix and tensor based PEFT methods. Our experiments on Natural Language Understanding, Instruction Tuning, Preference Optimization and Protein Folding benchmarks demonstrate that our method can achieve a reduction in the number of parameters while maintaining comparable performance.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2410.04060 [cs.CL]
	(or arXiv:2410.04060v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.04060

Submission history

From: Ignacio Hounie [view email]
[v1] Sat, 5 Oct 2024 06:59:50 UTC (614 KB)
[v2] Tue, 15 Oct 2024 16:03:20 UTC (614 KB)
[v3] Sun, 2 Feb 2025 17:56:53 UTC (471 KB)

Computer Science > Computation and Language

Title:LoRTA: Low Rank Tensor Adaptation of Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:LoRTA: Low Rank Tensor Adaptation of Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators