Skip to content
This repository was archived by the owner on Jul 7, 2023. It is now read-only.

Commit f7f3346

Browse files
committed
Add data file urls for Macedonian-English
1 parent 0c66117 commit f7f3346

File tree

1 file changed

+5
-0
lines changed

1 file changed

+5
-0
lines changed

tensor2tensor/data_generators/generator_utils.py

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -244,6 +244,11 @@ def gunzip_file(gz_path, new_path):
244244
"http://www.statmt.org/wmt13/training-parallel-un.tgz",
245245
["un/undoc.2000.fr-en.en", "un/undoc.2000.fr-en.fr"]
246246
],
247+
# Macedonian-English
248+
[
249+
"https://github.com/stefan-it/nmt-mk-en/raw/master/data/setimes.mk-en.train.tgz", # pylint: disable=line-too-long
250+
["train.mk", "train.en"]
251+
],
247252
]
248253

249254

0 commit comments

Comments
 (0)