Skip to content

kevin-v96/decoder-from-scratch

Repository files navigation

decoder-from-scratch

I've written nanoGPT following the tutorial by Andrej Karpathy before, but the jump from that character-level GPT to the models we pull from HuggingFace to use is too big.

I wrote this in a bid to get my hands dirty implementing a Decoder-only transformer architecture from scratch in PyTorch.

There's a couple of resources I followed to do this:

About

Decoder-only Transformer model implemented from scratch in Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published