kaldi/egs/sprakbanken
Dan Povey 8dc30c3b6b Updates to various data preparation scripts so validation checks on 'lang' directories will pass. It's possible some of these changes will break some setups, but it's not feasible to fully test this right now.
git-svn-id: https://svn.code.sf.net/p/kaldi/code/trunk@4739 5e6a8d80-dfce-4ca6-a32a-6e07a63d50c8
2015-01-02 03:38:04 +00:00
..
s5 Updates to various data preparation scripts so validation checks on 'lang' directories will pass. It's possible some of these changes will break some setups, but it's not feasible to fully test this right now. 2015-01-02 03:38:04 +00:00
README.txt sandbox/akirkedal: Added permissive rspecifier to compute-mfcc-feats in wsj/s5/steps/make_mfcc.sh:86 2014-04-11 02:46:56 +00:00

README.txt

About the sprakbanken corpus:
    This corpus is a free corpus originally collected by NST for ASR purposes and currently 
    hosted by the Norwegian libraries. The corpus is multilingual and contains Swedish, 
    Norwegian (Bokmål) and Danish. The current setup works for Danish. The vocabulary is 
    large and there is approx. 350 hours of read-aloud speech with associated text scripts.



  s1: This is the current recommended recipe. (Danish)