Machine Translation: Seq2Seq vs Transformer (German → English)

This project implements Neural Machine Translation (NMT) models from scratch for German → English translation using the Multi30k dataset.
The focus is on understanding sequence modeling, attention mechanisms, decoding strategies, and evaluation, rather than relying on pre-built libraries.

Models Implemented

Seq2Seq with Attention (LSTM)

Encoder–Decoder architecture using LSTM
Luong-style attention
Teacher forcing during training
Greedy decoding and Beam Search
Successfully trained and evaluated

Performance

BLEU score (test set): ~27
Produces fluent and semantically meaningful translations

Transformer (From Scratch)

Multi-Head Self-Attention
Positional embeddings
Encoder–Decoder stack
Causal and padding masks
Noam learning rate scheduler
Greedy and Beam Search decoding

Status

Training loss decreases normally
Suffers from token repetition during decoding
Very low BLEU score
Included to demonstrate practical difficulties of training Transformers from scratch on small datasets

📊 Evaluation

BLEU score computed using NLTK corpus BLEU
Includes smoothing
Evaluated on both training and test sets

from nltk.translate.bleu_score import corpus_bleu

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
models		models
train		train
utils		utils
.gitignore		.gitignore
README.md		README.md
outputs.ipynb		outputs.ipynb
requirments.txt		requirments.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Translation: Seq2Seq vs Transformer (German → English)

Models Implemented

Seq2Seq with Attention (LSTM)

Transformer (From Scratch)

📊 Evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Machine Translation: Seq2Seq vs Transformer (German → English)

Models Implemented

Seq2Seq with Attention (LSTM)

Transformer (From Scratch)

📊 Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages