Граф коммитов

203 Коммитов

Автор SHA1 Сообщение Дата
Reuben Morais 5c8af86aca
Merge pull request #2200 from rhamnett/patch-2
Allow for different sample rate exports
2019-07-04 15:04:22 +00:00
Reuben Morais 182c405eeb
Merge pull request #2160 from mozilla/more-mandarin-importers
More mandarin importers
2019-06-27 17:43:24 +00:00
Reuben Morais 67b4f6826a Add importer for aidatatang_200zh corpus 2019-06-27 14:40:20 -03:00
Reuben Morais 67a769e0d7 Add importer for Free ST Chinese Mandarin Corpus 2019-06-27 14:40:20 -03:00
Reuben Morais ee78d471a2 Add importer for Primewords Chinese Corpus Set 1 2019-06-27 14:40:20 -03:00
Richard Hamnett c248ed0435
Allow for different sample rate exports
Sets output format to Sox transformer
2019-06-23 19:47:12 +01:00
Reuben Morais 4b305d2f5e Remove --use_seq_length flag 2019-06-18 19:38:59 -03:00
Alexandre Lissy f33ead8af9 Frame counter should be initialized on all importers
Fixes #2150
2019-06-09 07:43:58 +02:00
Alexandre Lissy 1313b51a5d Ensure frames counter is initialized
Fixes #2150
2019-06-06 10:42:26 +02:00
Alexandre Lissy 32a73b7224 Do not fail without --bogus-records 2019-06-05 16:04:45 +02:00
Alexandre Lissy 3a17896463 Do not import known bogus Lingua Libre records
Fixes #2147
2019-06-04 18:45:34 +02:00
Alexandre Lissy 17e3f284a5 Computing audio hours at import 2019-05-28 16:46:20 +02:00
Arey 8f806f7a3a fit PEP8 2019-05-22 13:05:17 +08:00
Arey fbedbbc9f9 make flag name more explicit 2019-05-22 12:47:11 +08:00
Arey 53f88f0c33 common voice mandarin 2019-05-17 16:26:22 +08:00
Alexandre Lissy af26d8b2bb Deprecate training on Python 2.7 2019-05-10 17:23:54 +02:00
Alexandre Lissy d41f98f25c Ensure TrainingSpeech is properly formatted
Fixes #2097
2019-05-06 15:58:36 +02:00
Alexandre Lissy 8402e7ac9b Update TrainingSpeech dataset
Fixes #2092

X-DeepSpeech: NOBUILD
2019-05-06 10:13:42 +02:00
Alexandre Lissy 333a175dfd Force only one GPU on LDC93S1 scripts
Fixes #2087
2019-04-30 17:35:26 +02:00
Reuben Morais feacdea4aa Add AISHELL dataset importer 2019-04-29 10:00:32 -03:00
Alexandre Lissy 664813134e LinguaLibre importer
Fixes #2067
2019-04-25 18:47:59 +02:00
Reuben Morais 9586fbbd30 Rename --train_cached_features_path to --feature_cache 2019-04-16 11:06:26 -03:00
Reuben Morais 97c36291af Rename epoch flag to epochs 2019-04-05 09:30:50 -03:00
Reuben Morais 2f3f095048 Ignore epochs in checkpoints, always start epoch count from zero 2019-04-05 00:21:04 -03:00
Reuben Morais 57450893ea
Merge pull request #1919 from mozilla/tfdatatest
Implement input pipeline with tf.data API
2019-04-05 00:13:48 -03:00
Reuben Morais ed15caf3c5 Check if train/dev/test files were passed in instead of having explicit flags 2019-04-04 22:41:38 -03:00
Kelly Davis 25a254f1fc
Merge pull request #2000 from mozilla/issue1906
Gram Vaani importer
2019-04-03 13:43:46 +02:00
kdavis-mozilla 0bc132cabe Addressed review comments 2019-04-03 12:33:30 +02:00
Reuben Morais d6babfb8f3 Speed up training tests and make sure they fully converge 2019-04-02 18:31:32 -03:00
Reuben Morais 6632504ad1 Don't overwrite exported graph from training task with the TFLite version 2019-04-02 18:31:32 -03:00
Reuben Morais 1cea2b0fe8 Rewrite input pipeline to use tf.data API 2019-04-02 18:31:32 -03:00
kdavis-mozilla 441ac5869f Gram Vaani importer 2019-04-02 19:55:58 +02:00
Tilman Kamp 94c088be87 Updated README, some code beautification 2019-04-02 19:41:33 +02:00
Tilman Kamp 7dc236bab4 Removed unnecessary default value 2019-04-02 18:02:23 +02:00
Tilman Kamp 8e78e17904 Some code beautification 2019-04-02 18:02:23 +02:00
Tilman Kamp 5645285d25 Fix #1991 - Additional import options for import_cv2.py 2019-04-02 18:02:23 +02:00
Tilman Kamp a179a2389f Fix #1986 - Remove distributed training support 2019-04-01 18:43:22 +02:00
Tilman Kamp 192dfcad8d Added missing import 2019-03-19 15:42:41 +01:00
Tilman Kamp 6c6a4e08ca Fix #1962 2019-03-18 18:53:04 +01:00
Tilman Kamp 238987a70f Fix #1955 2019-03-14 12:25:08 +01:00
Reuben Morais a7367d4b40 Make bin/run-ldc93s1.sh run faster 2019-03-13 10:18:16 -03:00
Quentin Brunet 179ba1b533 Fix import_cv2.py binary file + permissions 2019-03-04 13:30:30 +01:00
Nicolas Panel 881e55e842
Use latest TS dataset
see https://github.com/nicolaspanel/TrainingSpeech#last-releases--download for more info
2019-02-20 10:08:56 +01:00
Reuben Morais c1212ffbb2 Fix shape of loaded preprocessed features 2019-02-12 20:31:24 -02:00
josh 7bee1cf445 train/test/dev checking now made explicit 2019-02-01 15:28:32 +01:00
josh 9810946035 usage 2019-02-01 13:36:19 +01:00
josh e26aefc39b better handling of empty TSV dir / comments 2019-02-01 13:32:27 +01:00
josh e6e33eb3a7 import script for new CV 2019-02-01 00:38:02 +01:00
Nicolas Panel 08bea8480a
use TrainingSpeech release 2018-11-24_fr_FR
`2018-11-24_fr_FR` fixes many annotation issues (~500) such as empty audio etc.
2018-11-25 08:56:03 +01:00
kdavis-mozilla 81b16002b7 Fixed #1740 (Switchboard Importer Creates WAV's at 8KHz not 16KHz) 2018-11-24 08:22:38 +01:00