English ↔ CodeMix (English-Hindi) translation using Seq2Seq neural models with attention. Explored the impact of Language Identification (LID) tags on translation performance.
-
Seq2Seq model with attention mechanism.
-
Translation directions:
- English → CodeMix
- CodeMix → English
-
Experimented with and without LID tags.
-
Evaluated using perplexity and BLEU scores.
- Without LID tags: BLEU ≈ 0.03 (English→CodeMix), 0.02 (CodeMix→English)
- With LID tags: BLEU ≈ 0.06 (English→CodeMix), 0.04 (CodeMix→English)
- Notebooks for all translation directions (with/without LID tags)
- Final Report
Python • PyTorch • Spacy • TorchText
- Vishal Patel
- Pawan Patidar