NLP-dataset General NER dataset (English) CoNLL-2003 OntoNotes-5.0 Wikigold Twitter kaggle MUC6 MUC7 NER dataset (Chinese) RenMinRiBao MSRA Boson Weibo Machine Translation (Chinese-English) WMT 2018 AI challenger (英中翻译规模最大的口语领域英中双语对照数据集) UM-Corpus: A Large English-Chinese Parallel Corpus OpenSubtitles2016 MultiUN