cntkcognitive-toolkitc-plus-plusc-sharpdeep-learningdeep-neural-networksdistributedjavamachine-learningneural-networkpython
3f0edb836f | ||
---|---|---|
BrainScript | ||
Common | ||
DataReader | ||
Demos | ||
Documentation | ||
ExampleSetups | ||
MachineLearning | ||
Math | ||
Scripts | ||
Tests | ||
license | ||
.gitattributes | ||
.gitignore | ||
CNTK.sln | ||
CppCntk.vssettings | ||
Makefile | ||
Makefile_kaldi.cpu | ||
Makefile_kaldi.gpu | ||
Makefile_kaldi2.cpu | ||
Makefile_kaldi2.gpu | ||
README | ||
ThirdPartyNotices.txt | ||
configure | ||
kaldi_vars.mk |
README
== Dev branch == This branch contains some features that are not yet checked into main branch. To enlist this branch, run git checkout origin/Dev == To-do == Add descriptions to LSTMnode Add descriptions to 0/1 mask segmentation in feature reader, delay node, and crossentropywithsoftmax node Change criterion node to use the 0/1 mask, following example in crossentropywithsoftmax node Add description of encoder-decoder simple network builder Add description of time-reverse node, simple network builder and NDL builder for bi-directional models == Author of the README == Kaisheng Yao Microsoft Research email: kaisheny@microsoft.com Wengong Jin, Shanghai Jiao Tong University email: acmgokun@gmail.com Yu Zhang, Leo Liu CSAIL, Massachusetts Institute of Technology email: yzhang87@csail.mit.edu email: leoliu_cu@sbcglobal.net Guoguo Chen CLSP, Johns Hopkins University email: guoguo@jhu.edu == Preeliminaries == To build the cpu version, you have to install intel MKL blas library or ACML library first. Note that ACML is free, where MKL may not be. for MKL: 1. Download from https://software.intel.com/en-us/intel-mkl for ACML: 1. Download from http://developer.amd.com/tools-and-sdks/cpu-development/amd-core-math-library-acml/ for Kaldi: 1. In kaldi-trunk/tools/Makefile, uncomment # OPENFST_VERSION = 1.4.1, and re-install OpenFst using the makefile. 2. In kaldi-trunk/src/, do ./configure --shared; make depend -j 8; make -j 8; and re-compile Kaldi (the -j option is for parallelization). To build the gpu version, you have to install NIVIDIA CUDA first == Build Preparation == Let $CNTK be the CNTK directory. >mkdir build >$CNTK/configure -h You will see various options for configure, as well as their default values. CNTK needs a CPU math directory, either acml or mkl. If you do not specify one and both are available, acml will be used. For GPU use, a cuda and gdk directory are also required. Similary, to build the kaldi plugin a kaldi directory is required. You may also specify whether you want a debug or release build. Rerun configure with the desired options. >$CNTK/configure ... This will create a Config.make and a Makefile (if you are in the $CNTK directory, a Makefile will not be created). The Config.make file records the configuration parameters and the Makefile reinvokes the $CNTK/Makefile, passing it the build directory where it can find the Config.make. After make completes, you will have the following directories: .build will contain object files, and can be deleted bin contains the cntk program lib contains libraries and plugins The bin and lib directories can safely be moved as long as they remain siblings. To clean >make clean == Run == All executables are in bin directory: cntk: The main executable for CNTK *.so: shared library for corresponding reader, these readers will be linked and loaded dynamically at runtime. ./cntk configFile=${your cntk config file} == Kaldi Reader == This is a HTKMLF reader and kaldi writer (for decode) To build, set KALDI_PATH in your Config.make The feature section is like: writer=[ writerType=KaldiReader readMethod=blockRandomize frameMode=false miniBatchMode=Partial randomize=Auto verbosity=1 ScaledLogLikelihood=[ dim=$labelDim$ Kaldicmd="ark:-" # will pipe to the Kaldi decoder latgen-faster-mapped scpFile=$outputSCP$ # the file key of the features ] ] == Kaldi2 Reader == This is a kaldi reader and kaldi writer (for decode) To build, set KALDI_PATH in your Config.make The features section is different: features=[ dim= rx= scpFile= featureTransform= ] rx is a text file which contains: one Kaldi feature rxspecifier readable by RandomAccessBaseFloatMatrixReader. 'ark:' specifiers don't work; only 'scp:' specifiers work. scpFile is a text file generated by running: feat-to-len FEATURE_RXSPECIFIER_FROM_ABOVE ark,t:- > TEXT_FILE_NAME scpFile should contain one line per utterance. If you want to run with fewer utterances, just shorten this file. (It will load the feature rxspecifier but ignore utterances not present in scpFile). featureTransform is the name of a Kaldi feature transform file: Kaldi feature transform files are used for stacking / applying transforms to features. An empty string (if permitted by the config file reader?) or the special string: NO_FEATURE_TRANSFORM says to ignore this option. ********** Labels ********** The labels section is also different. labels=[ mlfFile= labelDim= labelMappingFile= ] Only difference is mlfFile. mlfFile is a different format now. It is a text file which contains: one Kaldi label rxspecifier readable by Kaldi's copy-post binary.