kaldi/egs/timit/README.txt

About TIMIT:

  Available as LDC corpus LDC93S1, TIMIT is one of the original 
  clean speech databases. Description of catalog from LDC
  (http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC93S1):

  "The TIMIT corpus of read speech is designed to provide speech data 
   for acoustic-phonetic studies and for the development and evaluation
   of automatic speech recognition systems. TIMIT contains broadband
   recordings of 630 speakers of eight major dialects of American English,
   each reading ten phonetically rich sentences. The TIMIT corpus includes
   time-aligned orthographic, phonetic and word transcriptions as well as 
   a 16-bit, 16kHz speech waveform file for each utterance."

   Note: please do not use this TIMIT setup as a generic example of how to run
   Kaldi, as TIMIT has a very nonstandard structure.  Any of the other setups
   would be better for this purpose: e.g. librispeech/s5 is quite nice, and is
   free; yesno is very tiny and fast to run and is also free; and wsj/s5 has an
   unusually complete set of example scripts which may however be confusing.

Each subdirectory of this directory contains the scripts for a sequence
of experiments.

  s5: Monophone, Triphone GMM/HMM systems trained with Maximum Likelihood,
      followed by SGMM and DNN recipe.
      Training is done on 48 phonemes (see- Lee and Hon: Speaker-Independent
      Phone Recognition Using Hidden Markov Models. IEEE TRANSACTIONS ON
      ACOUSTICS. SPEECH, AND SIGNAL PROCESSING, VOL. 31. NO. 11, PG. 1641-48,
      NOVEMBER 1989, ). In scoring we map to 39 phonememes, as is usually 
      done in conference papers. 
      The earlier versions of TIMIT scripts were implemented by Navdeep Jaitly,
      Arnab Ghoshal. Current version was developed by Bagher BabaAli and is 
      maintained by Karel Vesely (vesis84@gmail.com).
Updated Readme file. git-svn-id: https://svn.code.sf.net/p/kaldi/code/trunk@833 5e6a8d80-dfce-4ca6-a32a-6e07a63d50c8 2012-04-02 18:13:44 +04:00			`About TIMIT:`
Add README files. git-svn-id: https://svn.code.sf.net/p/kaldi/code/trunk@830 5e6a8d80-dfce-4ca6-a32a-6e07a63d50c8 2012-04-02 15:20:47 +04:00
Updated Readme file. git-svn-id: https://svn.code.sf.net/p/kaldi/code/trunk@833 5e6a8d80-dfce-4ca6-a32a-6e07a63d50c8 2012-04-02 18:13:44 +04:00			`Available as LDC corpus LDC93S1, TIMIT is one of the original`
			`clean speech databases. Description of catalog from LDC`
			`(http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC93S1):`

			`"The TIMIT corpus of read speech is designed to provide speech data`
			`for acoustic-phonetic studies and for the development and evaluation`
			`of automatic speech recognition systems. TIMIT contains broadband`
			`recordings of 630 speakers of eight major dialects of American English,`
			`each reading ten phonetically rich sentences. The TIMIT corpus includes`
			`time-aligned orthographic, phonetic and word transcriptions as well as`
			`a 16-bit, 16kHz speech waveform file for each utterance."`

trunk: various bug-fixes relating to nnet2 training scripts. git-svn-id: https://svn.code.sf.net/p/kaldi/code/trunk@4597 5e6a8d80-dfce-4ca6-a32a-6e07a63d50c8 2014-11-10 21:38:16 +03:00			`Note: please do not use this TIMIT setup as a generic example of how to run`
			`Kaldi, as TIMIT has a very nonstandard structure. Any of the other setups`
			`would be better for this purpose: e.g. librispeech/s5 is quite nice, and is`
			`free; yesno is very tiny and fast to run and is also free; and wsj/s5 has an`
			`unusually complete set of example scripts which may however be confusing.`

Updated Readme file. git-svn-id: https://svn.code.sf.net/p/kaldi/code/trunk@833 5e6a8d80-dfce-4ca6-a32a-6e07a63d50c8 2012-04-02 18:13:44 +04:00			`Each subdirectory of this directory contains the scripts for a sequence`
			`of experiments.`

timit : removing the outdated 's3', 's4' scripts, - updated the README.txt 2015-09-01 04:13:22 +03:00			`s5: Monophone, Triphone GMM/HMM systems trained with Maximum Likelihood,`
			`followed by SGMM and DNN recipe.`
			`Training is done on 48 phonemes (see- Lee and Hon: Speaker-Independent`
Updated Readme file. git-svn-id: https://svn.code.sf.net/p/kaldi/code/trunk@833 5e6a8d80-dfce-4ca6-a32a-6e07a63d50c8 2012-04-02 18:13:44 +04:00			`Phone Recognition Using Hidden Markov Models. IEEE TRANSACTIONS ON`
			`ACOUSTICS. SPEECH, AND SIGNAL PROCESSING, VOL. 31. NO. 11, PG. 1641-48,`
timit : removing the outdated 's3', 's4' scripts, - updated the README.txt 2015-09-01 04:13:22 +03:00			`NOVEMBER 1989, ). In scoring we map to 39 phonememes, as is usually`
			`done in conference papers.`
			`The earlier versions of TIMIT scripts were implemented by Navdeep Jaitly,`
			`Arnab Ghoshal. Current version was developed by Bagher BabaAli and is`
			`maintained by Karel Vesely (vesis84@gmail.com).`
Add README files. git-svn-id: https://svn.code.sf.net/p/kaldi/code/trunk@830 5e6a8d80-dfce-4ca6-a32a-6e07a63d50c8 2012-04-02 15:20:47 +04:00