vision_transformer_numpy

NumPy Implementation of the Vision Transformer (ViT) on Num/Cu-py

In order to gain a deeper understanding of Vision Transformers (ViT) and also I didnt see any previous work has demonstrated backward propagation in conjunction with forward propagation, Therefore, I come up with implementing vision transformer in numpy (cpu)/ cupy(gpu)

Here are the main benefits of implementing ViT in NumPy:

It aids in comprehending the underlying mathematics, preventing the abstraction of the learning process.
It eliminates the need for the pytorch framework.

Dataset

For sake of simplicity the code uses MNIST dataset as from here.

Training

The model trained in the code is currently not saved. Loss and metrics are provided.

Need to add/implement

Resolve bugs of overflow errors and occurance of nan values
save model weights
unit tests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision_transformer_numpy

Dataset

Training

Need to add/implement

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

vision_transformer_numpy

Dataset

Training

Need to add/implement