firefox-translations-training/utils
Greg Tatum 34b9c1d76c
Add an HPLT data importer (#837)
* Add hplt test data

* Add a way to hook into when read_lines switches locations

* Provide a way to create a test fixture file with a list of strings

* Add a min-fluency-threshold to monolingual datasets

* Add an HPLT monolingual data importer

* Add support for HPLT and OPUS monolingual data in the config generator

* Unify the hash_line uses with a WeakStringSet implementation
2024-09-16 11:43:18 -05:00
..
tasks Remove the Makefile and replace it with a Taskfile (#510) 2024-04-09 16:11:13 -05:00
build-mono-nllb.py Add a mono nllb build script (#780) 2024-08-02 10:18:36 -05:00
config_generator.py Add an HPLT data importer (#837) 2024-09-16 11:43:18 -05:00
download_hplt.py Add HPLT mono bulk importer (#645) 2024-05-29 14:25:08 -07:00
find_corpus.py Add an HPLT data importer (#837) 2024-09-16 11:43:18 -05:00
marian_client.py Add Marian server for model testing (#492) 2024-03-28 15:53:16 -07:00
preflight_check.py Add support for automatically continuing training from earlier runs of a Task (fixes #270) (#580) 2024-05-17 16:07:36 -04:00
run_model.py Remove the Makefile and replace it with a Taskfile (#510) 2024-04-09 16:11:13 -05:00
taskcluster_downloader.py Remove the Makefile and replace it with a Taskfile (#510) 2024-04-09 16:11:13 -05:00
tb_log_parser.py Add ruff and black linting to the CI (#187) 2023-09-08 09:50:24 -05:00
train.py restrict github-push taskcluster events to `main` (#777) 2024-09-06 13:53:31 -04:00