Skip to content

smoke-y/tamilGPT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 

Repository files navigation

  __                    .__ .__     ________ __________ ___________ 
_/  |_ _____     _____  |__||  |   /  _____/ \______   \\__    ___/ 
\   __\\__  \   /     \ |  ||  |  /   \  ___  |     ___/  |    |    
 |  |   / __ \_|  Y Y  \|  ||  |__\    \_\  \ |    |      |    |    
 |__|  (____  /|__|_|  /|__||____/ \______  / |____|      |____|    
            \/       \/                   \/                        

Training a GPT in 4 hours on tamil tokens.

MODEL

nanoGPT

Implement Andrej Karptathy's nanoGPT

modded-nanoGPT

This is a repo trying to train nanoGPT under 3 mins from scratch. Using this repo as a reference, we apply these changes to nanoGPT

Now you can train a GPT on a cheap NVIDIA chip.

GETTING STARTED

Download ai4bharat's dataset(ta.txt) and place it under data/. Run src/clean.py and finally src/train.py. Modify batch size based on your VRAM(src/model.py).

You can find the weights here.

FAILED EXPERIMENT

About

GPT-2 emitting tamil tokens

Topics

Resources

License

Stars

Watchers

Forks

Languages