2505.00582

Total: 1

#1 Block Circulant Adapter for Large Language Models [PDF²] [Copy] [Kimi²] [REL]

Authors: Xinyu Ding, Meiqi Wang, Siyu Liao, Zhongfeng Wang

Fine-tuning large language models (LLMs) is difficult due to their huge model size. Recent Fourier domain-based methods show potential for reducing fine-tuning costs. We propose a block circulant matrix-based fine-tuning method with a stable training heuristic to leverage the properties of circulant matrices and one-dimensional Fourier transforms to reduce storage and computation costs. Experiments show that our method uses $14\times$ less number of parameters than VeRA, $16\times$ smaller than LoRA and $32\times$ less FLOPs than FourierFT, while maintaining close or better task performance. Our approach presents a promising way in frequency domain to fine-tune large models on downstream tasks.

Subjects: Computation and Language , Machine Learning

Publish: 2025-05-01 15:14:32 UTC

2505.00582

#1 Block Circulant Adapter for Large Language Models [PDF2] [Copy] [Kimi2] [REL]

#1 Block Circulant Adapter for Large Language Models [PDF²] [Copy] [Kimi²] [REL]