IniLoRA: Optimizing Fine-Tuning through Advanced Initialization

Strategies for Low-Rank Adaptation The rapid development of parameter-efficient fine-tuning methods has noticeably improved the efficiency of adapting large language models. Among these, LoRA has gained widespread popularity due to its strong balance of effectiveness and parameter efficiency. However, LoRA relies on initializing two low-rank matrices whose product is zero, which limits its ability to effectively activate and leverage the original model weights—creating a potential bottleneck for optimal performance. To address this limitation, we propose \mbox{\textbf{IniLoRA}}, a novel initialization strategy that initializes the low-rank matrices to closely approximate the original model weights. Experimental results indicate that IniLoRA achieves better performance than LoRA across a range of models and tasks. Additionally, we introduce two variants, IniLoRA-$\alpha$ and IniLoRA-$\beta$, both leveraging distinct initialization methods to enhance performance further.

IniLoRA Method

Weight approximation experiments

The weights of the q and v modules of llama2-7b are approximated

cd matrix_decomposition && python model_weight_decomposition_llm.py --model meta-llama/Llama-2-7b-hf

or: The weights of the q and v modules of roberta are approximated

cd matrix_decomposition && python model_weight_decomposition_roberta.py --model FacebookAI/roberta-base

The results of the weight approximation are saved in the directory /work/Codes/IniLoRA/matrix_decomposition/init_weights/Llama-2-7b-hf/rank-8-iterNum-20000-lr-0.0005/, referred to as weight_init_path.
Modify the /work/Codes/IniLoRA/peft/tuners/lora/layer.py file to set the value of the root_path variable in the sgd_svd function to weight_init_path

Training and Evaluation

Performance on GSM8K/MATH

Download the training set MetaMathQA-395K.json from https://huggingface.co/datasets/meta-math/MetaMathQA/tree/main and place it in the data folder
Execute bash scripts/train_gsm8k_math.sh to start training.
Run bash scripts/test_gsm8k.sh to evaluate on the GSM8K test set.
Run bash scripts/test_math.sh to evaluate on the MATH test set.

Performance on HumanEval

Download the training set code_alpaca_20k.json from https://huggingface.co/datasets/sahil2801/CodeAlpaca-20k/tree/main and place it in the data folder
Run bash scripts/train_code.sh to start training.
Run cd scripts && bash test_humaneval.sh to evaluate on the HumanEval benchmark.

Performance on MMLU

Data processing cd scripts && prepare_data.sh
Run bash scripts/train_mmlu.sh to start training
Run cd scripts && bash test_mmlu.sh to evaluate on the MMLU benchmark.

Performance on GLUE

Run bash scripts/train_glue_tasks.sh to perform training and evaluation.

IniLoRA-alpha Method

Execute bash scripts/train_with_IniLoRA_alpha.sh to start training.

IniLoRA-beta Method

Execute bash scripts/train_with_IniLoRA_beta.sh to start training.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
deepspeed		deepspeed
inference		inference
matrix_decomposition		matrix_decomposition
peft		peft
scripts		scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
args.py		args.py
finetune.py		finetune.py
merge_lora.py		merge_lora.py
nlu_glue_mnli.py		nlu_glue_mnli.py
nlu_glue_other_tasks.py		nlu_glue_other_tasks.py
requirements.txt		requirements.txt
train.py		train.py
train_code.py		train_code.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IniLoRA: Optimizing Fine-Tuning through Advanced Initialization

IniLoRA Method

Weight approximation experiments

Training and Evaluation

Performance on GSM8K/MATH

Performance on HumanEval

Performance on MMLU

Performance on GLUE

IniLoRA-alpha Method

IniLoRA-beta Method

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

IniLoRA: Optimizing Fine-Tuning through Advanced Initialization

IniLoRA Method

Weight approximation experiments

Training and Evaluation

Performance on GSM8K/MATH

Performance on HumanEval

Performance on MMLU

Performance on GLUE

IniLoRA-alpha Method

IniLoRA-beta Method

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages