MiniLlama2

The base structure of the project is picked from Carnegie Mellon University's CS11-711 Advanced NLP assignment. Pretrained weights are loaded for the language model from stories42M.pt; an 8-layer, 42M parameter language model pretrained on the TinyStories dataset. The project implements a small version of Llama2 model and performs sentence classification on sst dataset and cfimdb. The documentation.md file contains descriptions of the classes and each function within them.

Key features Implementd

Implemented Multiheaded cross attention using RoPE relative positional embeddings.
Implemented a Tranformer block.
Implemented rotary embeddings.
Implemented classification head for sentence classification.
Added implementations of Lora Layer and Lora Wrapper.
Implemented AdamW optimizer.

Features tested

Generated text completion starting with a prefix.
Performed zero-shot prompt based sentiment analysis on the 2 datasets.
Performed fine-tuning of the model along with a classification head.
Performed parameter efficient fine-tuning using self implementation of LoRa.

Running the code

Run setup.sh for intitial configuration.
The various commands for running each feature are present in commands.txt.

Results and Outputs

The detailed explaination of different result files are present in results.txt
The best hyperparameters are shown in the table below.

Dataset	Method	Epochs	Batch Size	Learning Rate	Alpha	Rank
sst	Fine-tuning	5	80	2e-5	-	-
sst	LoRA	5	80	2e-3	1	4
cfimdb	Fine-tuning	5	10	2e-5	-	-
cfimdb	LoRA	5	10	2e-3	1	4

The table below shows the best sentiment classification accuracies for both the datasets.

Dataset	Fine-tuning (Train/Dev/Test)	LoRA (Train/Dev/Test)
sst	0.77 / 0.41 / 0.41	0.45 / 0.43 / 0.41
cfimdb	0.85 / 0.83 / 0.45	0.879 / 0.869 / 0.52

Acknowledgement

This code is based on llama2.c by Andrej Karpathy. Parts of the code are also from the transformers library (Apache License 2.0).

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
__pycache__		__pycache__
data		data
LICENSE		LICENSE
LoRA.py		LoRA.py
README.md		README.md
base_llama.py		base_llama.py
cfimdb-dev-finetuning-output.txt		cfimdb-dev-finetuning-output.txt
cfimdb-dev-prompting-output.txt		cfimdb-dev-prompting-output.txt
cfimdb-test-finetuning-output.txt		cfimdb-test-finetuning-output.txt
cfimdb-test-prompting-output.txt		cfimdb-test-prompting-output.txt
classifier.py		classifier.py
commands.txt		commands.txt
config.py		config.py
documentation.md		documentation.md
generated-sentence-temp-0.txt		generated-sentence-temp-0.txt
generated-sentence-temp-1.txt		generated-sentence-temp-1.txt
llama.py		llama.py
optimizer.py		optimizer.py
optimizer_test.npy		optimizer_test.npy
optimizer_test.py		optimizer_test.py
results.txt		results.txt
rope.py		rope.py
rope_test.py		rope_test.py
rotary_embedding_actual.data		rotary_embedding_actual.data
rotary_embedding_actual.npy		rotary_embedding_actual.npy
run_llama.py		run_llama.py
sanity_check.data		sanity_check.data
sanity_check.py		sanity_check.py
setup.sh		setup.sh
sst-dev-finetuning-output-lora.txt		sst-dev-finetuning-output-lora.txt
sst-dev-finetuning-output.txt		sst-dev-finetuning-output.txt
sst-dev-prompting-output.txt		sst-dev-prompting-output.txt
sst-test-finetuning-output-lora.txt		sst-test-finetuning-output-lora.txt
sst-test-finetuning-output.txt		sst-test-finetuning-output.txt
sst-test-prompting-output.txt		sst-test-prompting-output.txt
tokenizer.model		tokenizer.model
tokenizer.py		tokenizer.py
train_llama.py		train_llama.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MiniLlama2

Key features Implementd

Features tested

Running the code

Results and Outputs

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

License

vaishdho1/miniLlama2

Folders and files

Latest commit

History

Repository files navigation

MiniLlama2

Key features Implementd

Features tested

Running the code

Results and Outputs

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages