Small LLM Project

This project demonstrates how to build, train, and interact with a small Large Language Model (LLM) using a custom dataset and tokenizer.

Overview

Custom Training: Train a language model on your own text data with a custom tokenizer.
Model Checkpoints: Save and reuse model checkpoints for evaluation or further training.
Web Demo: Interact with the trained model through a simple web interface.
Evaluation: Test and explore the model’s outputs using scripts or notebooks.

Prepare Data: Process your text data and set up a tokenizer.
Train Model: Run the training script to build your LLM.
Demo & Inference: Use the web app or scripts to interact with the model and see results.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
__pycache__		__pycache__
local_tokenizer		local_tokenizer
static		static
templates		templates
wikitext_dataset		wikitext_dataset
README.md		README.md
app.py		app.py
dataset_save.py		dataset_save.py
demo.ipynb		demo.ipynb
demo.py		demo.py
model.py		model.py
model_2.py		model_2.py
small_llm_1.pth		small_llm_1.pth
small_llm_2.pth		small_llm_2.pth
small_llm_3.pth		small_llm_3.pth
small_llm_4.pth		small_llm_4.pth
small_llm_5.pth		small_llm_5.pth
test.py		test.py
train.py		train.py
train_data.txt		train_data.txt