Skip to content

lizardp1/transformer-addition

Repository files navigation

transformer-addition

Training a small transformer to learn 3-digit addition with carry balanced data.

Files

  • data_gen.py — Tokenizer and carry-balanced data generation
  • train.py — Model config and training loop (logs to wandb)
  • eval.py — Evaluation on in-distribution and out-of-distribution digit lengths
  • Learning_Addition_Elizabeth_Pavlova.pdf — Full write-up with results and discussion

Usage

pip install -r requirements.txt
python train.py
python eval.py

Environment

Tested on Google Colab (T4 GPU)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages