Tilman Kamp
|
9d36852f0e
|
Skipping samples also on fragment level problems
|
2020-08-11 11:02:04 +02:00 |
Tilman Kamp
|
85cfe7624f
|
Added missing catalogs; documentation fixes
|
2020-07-21 17:22:35 +02:00 |
Tilman Kamp
|
bcdbfee856
|
Fixes around custom scorer generation
|
2020-07-03 17:30:04 +02:00 |
Bias
|
e9ae8b7903
|
Enabled multithreaded transcribing again as I'm not sure if it's only my env which OOMs
|
2020-07-03 16:21:11 +02:00 |
Bias
|
6b7270c96a
|
Remove scorer after it's been used but only if it was generated
|
2020-07-03 16:21:11 +02:00 |
Bias
|
3dc0fbb44c
|
Cleaned up some things and fetching new en model for testing
|
2020-07-03 16:21:11 +02:00 |
Bias
|
5cfa401ff1
|
Making sure files are closed
|
2020-07-03 16:21:11 +02:00 |
Bias
|
9c09da471a
|
Fixed unicode in output
|
2020-07-03 16:21:11 +02:00 |
Bias
|
05c96731c4
|
Removing temp lm files
|
2020-07-03 16:21:11 +02:00 |
Bias
|
f6cf144382
|
Initial support for DeepSpeech 0.7.1
|
2020-07-03 16:21:11 +02:00 |
Tilman Kamp
|
39a633a434
|
Updated documentation and minor tool fixes
|
2020-07-01 17:59:46 +02:00 |
Tilman Kamp
|
f6a16d92a0
|
Fix empty set-assignments
|
2020-05-04 18:18:52 +02:00 |
Tilman Kamp
|
f3e594f566
|
Better progress logging in SDB tool
|
2020-03-09 12:08:53 +01:00 |
Tilman Kamp
|
e06a702756
|
Fix missing utf-8 decoding on SDB meta data reading
|
2020-03-05 15:37:33 +01:00 |
Tilman Kamp
|
ebb2e9721f
|
Simplified main routine
|
2020-02-27 16:05:19 +01:00 |
Tilman Kamp
|
3dac0f8db3
|
Progress reporting on meta file writing; filenames in log messages
|
2020-02-27 16:03:59 +01:00 |
Tilman Kamp
|
149805990e
|
Closing od SDB reader at end of SortingSDBWriter finalization
|
2020-02-27 16:02:26 +01:00 |
Tilman Kamp
|
d41c0b1ff7
|
Exporter: Second chance conversion and ability to skip samples on audio errors
|
2020-02-26 11:50:52 +01:00 |
Tilman Kamp
|
f8cd176b8d
|
Post-refactoring fix of exporter's de-biasing
|
2020-02-25 18:53:40 +01:00 |
Tilman Kamp
|
0c2ea1b983
|
Export plan as a cache for export preparation steps
|
2020-02-25 16:54:53 +01:00 |
Tilman Kamp
|
e03e830685
|
Better split-field checking
|
2020-02-25 16:10:41 +01:00 |
Tilman Kamp
|
ecd2a74906
|
Better checking for existing target paths
|
2020-02-25 16:09:41 +01:00 |
Tilman Kamp
|
8897bb6cc7
|
Refactored exporter for better maintenance and lower memory footprint; --tmp-dir option; CSV meta files
|
2020-02-25 15:44:37 +01:00 |
Tilman Kamp
|
536dc6b006
|
Exporter argument parsing as own function
|
2020-02-24 11:19:59 +01:00 |
Tilman Kamp
|
23a4569ba5
|
Using heapq.merge for interleaving
|
2020-02-21 19:08:37 +01:00 |
Tilman Kamp
|
061f77bb62
|
Updated custom mime-types
|
2020-02-21 19:07:45 +01:00 |
Tilman Kamp
|
3a05b79285
|
Removed tqdm from stats.py
|
2020-02-21 13:10:28 +01:00 |
Tilman Kamp
|
e788cee6dc
|
Fix #25
|
2020-02-21 13:02:35 +01:00 |
Tilman Kamp
|
e8fb5895ca
|
Additional parameter for SDB finalization sample buffer size
|
2020-02-20 17:16:55 +01:00 |
Tilman Kamp
|
0d45123b29
|
Progress logging: Prevent division by 0
|
2020-02-20 17:16:03 +01:00 |
Tilman Kamp
|
842d50a950
|
Fix for meta fields that are single values instead of lists
|
2020-02-20 13:57:05 +01:00 |
Tilman Kamp
|
4e296d4011
|
Changed some debug log-messages to info ones
|
2020-02-20 11:44:09 +01:00 |
Tilman Kamp
|
7993f214fd
|
Progress logging to stderr
|
2020-02-19 17:55:43 +01:00 |
Tilman Kamp
|
82d40d82f4
|
Better progress logging; Output of fragment meta data on sample cutting problem
|
2020-02-19 17:29:10 +01:00 |
Tilman Kamp
|
02305706d0
|
Remove incompatible stty sane call
|
2020-02-18 17:28:11 +01:00 |
Tilman Kamp
|
f7e2f7f0ab
|
Refactored CollectionSample to LabeledSample
|
2020-02-18 15:53:28 +01:00 |
Tilman Kamp
|
fe8588565f
|
Better progress logging in catalog tool
|
2020-02-18 14:08:27 +01:00 |
Tilman Kamp
|
7fa6773bc4
|
Fixes for logging and argument parsing
|
2020-02-18 14:07:08 +01:00 |
Tilman Kamp
|
28806dfff6
|
Fix #16
|
2020-02-18 14:05:53 +01:00 |
Tilman Kamp
|
b4a0e3bd85
|
Catalog tool
|
2020-02-17 18:27:41 +01:00 |
Tilman Kamp
|
06ab3a2d17
|
Meta data output independent of sample output format
|
2020-02-17 13:47:11 +01:00 |
Tilman Kamp
|
1383951191
|
Fix --no-progress option; better info logs
|
2020-02-17 12:34:56 +01:00 |
Tilman Kamp
|
aacbda1676
|
Ability to drop samples with unknown and/or multi-instance meta data
|
2020-02-17 11:41:55 +01:00 |
Tilman Kamp
|
3342067a0b
|
Meta data output on SDB export
|
2020-02-14 18:29:50 +01:00 |
Tilman Kamp
|
42556e1988
|
Late binding of opuslib, nicer progress bars on export
|
2020-02-13 12:21:25 +01:00 |
Tilman Kamp
|
dfc565190d
|
Progress indication during SDB finalization
|
2020-02-12 17:30:27 +01:00 |
Tilman Kamp
|
836b87f4a5
|
Removed debugging artifact
|
2020-02-12 14:46:13 +01:00 |
Tilman Kamp
|
bf9fcd15ad
|
Wave and dry-run support for SDB export
|
2020-02-12 13:55:49 +01:00 |
Tilman Kamp
|
bb51dbf193
|
Updated SDB support
|
2020-02-12 11:48:08 +01:00 |
Tilman Kamp
|
a8aa0ae2d5
|
Fix: Initializing Opus frame remainders with zeros
|
2020-02-10 11:43:32 +01:00 |