Data for Machine Translation: Lab Week 2

The sentences in these files originate from the Tatoeba Corpus and have been downloaded from ManyThings.

In the files found here, the sentences have been shuffled and split according to language. They are categorized into three portions: training (70%), validation (15%) and testing (15%).

This data is licensed under the Attribution 2.0 France (CC BY 2.0 FR).

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
test.en-nl.en		test.en-nl.en
test.en-nl.nl		test.en-nl.nl
train.en-nl.en		train.en-nl.en
train.en-nl.nl		train.en-nl.nl
valid.en-nl.en		valid.en-nl.en
valid.en-nl.nl		valid.en-nl.nl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data for Machine Translation: Lab Week 2

About

Uh oh!

Releases

Packages

Languages

esther2000/MT-2022

Folders and files

Latest commit

History

Repository files navigation

Data for Machine Translation: Lab Week 2

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages