GitHub - mfshi03/transformers: Encoder-Decoder transformer for translating natural language to code in PyTorch

This will be a naive implementation of the Transformer Architecture in PyTorch. Transformers address problems in natural language processing through the use of self-attention. This allows the Transformer to understand patterns in linear sequences and reason about them by learning long-range patterns in text.

How to use

Experiment for code-generation in train_code.py

Code

To run the experiment for NMT for code-generation, run train_code.py

To test your model, run run_inference.py and change your model weights on this line: transformer.load_state_dict(torch.load(f"weights/transformer_code_50.pth"))

Examples:

BLEU Score during Training:

Preliminary Plan

Read the following reference material on Github for implementation specifics by 9/17
- nanoGPT, Andrej Karpathy's GPT2 Transformer implementation with a custom vocab
- X-Transformers, A transformer library built with many custom tooling from a variety of different papers
- HuggingFace Transformer's BERT, HugginFace's implementation for BERT
Write a system design document for classes and necessary methods for implementation 9/20
Build a custom text vocab dataset for training and validation. 10/1
- Use tiktoken library to tokenize sentences and phrases.
Complete core transformer classes and methods 10/10
Make a dataset loader module for running experiments to train our transformer on the custom dataset 10/20
Wrap-up experiment and document experimental results such as training performance 11/1

Papers

Adapting for Code-Generation

Online Resources

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
data		data
imgs		imgs
model		model
.gitignore		.gitignore
README.md		README.md
bleu.py		bleu.py
display_results.py		display_results.py
display_results_bleu.py		display_results_bleu.py
display_results_loss.py		display_results_loss.py
display_results_ppl.py		display_results_ppl.py
eval.txt		eval.txt
requirements.txt		requirements.txt
run_inference.py		run_inference.py
train.py		train.py
train_code.py		train_code.py
training.txt		training.txt
utils.py		utils.py
validation.txt		validation.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

mfshi03/transformers

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages