SGET

This repository contains SGET, Structural GEneralization Benchmark based on English-Japanese Machine Translation.

Dataset

SGET is in data/. data/train.txt is used for training models, and data/dev.txt is used for validations. data/test.txt is the in-distribution test set and data/gen.txt is the out-of-distribution generalization set.

Experiments

Setup

pip install -r requirements.txt

Train

For training vanilla Transformer, first replace [SEED] in scripts/config_transformer.yaml with a random seed. Then run the following command, replacing [SEED] with the seed.

cd scripts
sh train_transformer.sh [SEED]

You can train LSTM (lstm) similarly.

For fine-tuning Llama 2, run the following command.

cd scripts
sh lora_llama2.sh [SEED]

Translate

Run the following command for generating translations using a trained (fine-tuned) model. The checkpoint used for translations is determined by [SEED] and [TRAIN_STEPS].

cd scripts
sh translate_transformer.sh [SEED] [TRAIN_STEPS]

You can use LSTM and Llama 2 similarly.

Translation results will be generated in results/[MODEL]/[SEED]/[TRAIN_STEPS].

License

This repository is primarily licensed under MIT License, but scripts/lora.py is licensed under Apache License, Version 2.0.

Citation

You can reference this work as follows:

@inproceedings{kumon-etal-2024-evaluating,
    title = "Evaluating Structural Generalization in Neural Machine Translation",
    author = "Kumon, Ryoma  and
      Matsuoka, Daiki  and
      Yanaka, Hitomi",
    editor = "Ku, Lun-Wei  and
      Martins, Andre  and
      Srikumar, Vivek",
    booktitle = "Findings of the Association for Computational Linguistics: ACL 2024",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.findings-acl.783/",
    doi = "10.18653/v1/2024.findings-acl.783",
    pages = "13220--13239",
}

Contact

If you have any issues or questions, please contact kumoryo9@is.s.u-tokyo.ac.jp

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
scripts		scripts
.gitignore		.gitignore
Apache-2.0.txt		Apache-2.0.txt
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SGET

Dataset

Experiments

Setup

Train

Translate

License

Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ynklab/SGET

Folders and files

Latest commit

History

Repository files navigation

SGET

Dataset

Experiments

Setup

Train

Translate

License

Citation

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages