LLM_from_scratch

A collection of Collab notebooks that implements Bigram Language model & GPT (as per "Attention is all you need paper")

The 2 models were trained on a small text corpus (wizard-of-oz.txt) and used to predict the next words for a given prompt at the end.

Note: This could be extended to larger corpuses of texts for real-world LLMs, but for this project, I've used a smaller corpus for faster training.

Project Structure

The directory structure of the project is divided into 2 folders, each containing 1 model (Bigram Language model & GPT).

Each of these folders contain the necessary files & notebooks needed to train & run the models in their entirety.

Root
├── Bigram
│   ├── bigram.ipynb
│   └── wizard-of-oz.txt
└── GPT
    ├── gpt_basic.ipynb
    ├── vocab.txt
    └── wizard-of-oz.txt

How to Run:

Each model in their respective folders are run in the same way.

Step 1: Open the .ipynb notebook in Google Colab.
Step 2: Upload the wizard-of-oz.txt from both folders.
Step 3: Connect to a colab runtime. A GPU runtime is recommended for faster results, but CPU would also work (albeit at a slower pace)
Step 4: Run each cell sequentially. (Especially for gpt_basic.ipynb to generate train_split.txt and val_split.txt)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Bigram		Bigram
GPT		GPT
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM_from_scratch

Project Structure

How to Run:

All Done ✨

About

Uh oh!

Releases

Packages

Languages

Kabiirk/LLM_from_scratch

Folders and files

Latest commit

History

Repository files navigation

LLM_from_scratch

Project Structure

How to Run:

All Done ✨

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages