Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
Перейти к файлу
Guoguo Chen ae694834b2 Moving -DHAVE_OPENFST_GE_10400 option from makefile to kaldi_vars.mk 2015-06-04 18:32:39 +00:00
CheckInSuites change cn.exe to cntk in examples setups, demos, tutorials, and check-in suites. 2015-05-19 14:10:45 +08:00
Common Adding code for smbr/mpfe training on Kaldi lattices; this is the initial working version, more will follow 2015-05-31 19:03:44 -04:00
DataReader Further fixes to sequence training on Kaldi lattices (move data from GPU to CPU before lattice computation) 2015-06-02 16:25:45 -04:00
Demos Merge branch 'master' of https://git.codeplex.com/cntk into linux-gcc 2015-05-19 16:04:47 -07:00
Documentation rename CNTKbook to include date. 2015-05-19 14:19:05 +08:00
ExampleSetups Merge branch 'master' of https://git.codeplex.com/cntk into linux-gcc 2015-05-19 16:04:47 -07:00
MachineLearning fix small issues in sequence training; undo one MPI related change in SGD.h that was accidently checked in yesterday. 2015-06-01 17:50:28 -04:00
Math Remove the redefinition of some function to make it can be build in linux side. 2015-05-24 13:42:58 -04:00
license
.gitignore The commits 32391e1..2a7ab60 should not be pushed to public (since it enables MKL and MPI) revert them in this commit. 2015-05-19 14:08:16 -07:00
CNTK.sln removed the Setups Solution folder for now, since it cannot be easily populated with VS 2015-05-18 23:51:46 -07:00
Makefile.cpu Merge branch 'master' of https://git.codeplex.com/cntk into linux-gcc 2015-05-19 16:04:47 -07:00
Makefile.gpu Add a comment about how to get nvml.h 2015-05-24 13:34:53 -04:00
Makefile_kaldi.cpu Visual Studio Solution cleanup: 2015-05-18 20:32:39 -07:00
Makefile_kaldi.gpu Visual Studio Solution cleanup: 2015-05-18 20:32:39 -07:00
Makefile_kaldi2.cpu Moving -DHAVE_OPENFST_GE_10400 option from makefile to kaldi_vars.mk 2015-06-04 18:32:39 +00:00
Makefile_kaldi2.gpu Moving -DHAVE_OPENFST_GE_10400 option from makefile to kaldi_vars.mk 2015-06-04 18:32:39 +00:00
README Merge branch 'master' of https://git.codeplex.com/cntk into linux-gcc 2015-05-19 16:04:47 -07:00
kaldi_vars.mk Moving -DHAVE_OPENFST_GE_10400 option from makefile to kaldi_vars.mk 2015-06-04 18:32:39 +00:00

README

== Author of the README ==
    Wengong Jin,
    Shanghai Jiao Tong University
    email: acmgokun@gmail.com

    Hakan Erdogan
    MERL, Sabanci University
    email: haerdogan@sabanciuniv.edu

    Yu Zhang, Leo Liu
    CSAIL, Massachusetts Institute of Technology
    email: yzhang87@csail.mit.edu
    email: leoliu_cu@sbcglobal.net

== Preeliminaries ==
To build the cpu version, you have to install intel MKL blas library or ACML library first. Note that ACML is free, where MKL may not be.

for MKL:
1. Download from https://software.intel.com/en-us/intel-mkl
2. You can modify variable MKL_PATH in makefile.cpu to change your mkl path.
Then add ${MKL_PATH}/mkl/lib/intel64, ${MKL_PATH}/mkl/lib/mic, ${MKL_PATH}/compiler/lib/intel64. ${MKL_PATH}/compiler/lib/mic to your ${LD_LIBRARY_PATH} to make sure the program links the library correctly.

for ACML:
1. Download from http://developer.amd.com/tools-and-sdks/cpu-development/amd-core-math-library-acml/
2. Modify ACML_PATH in the makefile.cpu and makefile.gpu to provide your ACML library path.
You need to add ${ACML_PATH}/lib to your ${LD_LIBRARY_PATH}.

To build the gpu version, you have to install NIVIDIA CUDA first
You can modify the path CUDA_PATH in makefile.cpu to change your cuda path
We use cuda-7.0 as default.
Then add ${CUDA_PATH}/lib, ${CUDA_PATH}/lib64 to your ${LD_LIBRARY_PATH} to make sure the program links to the library correctly.

... TODO: add documentation on nvml lib

== Build ==
To build the cpu version, run
	make -f Makefile.cpu 
To build the gpu version, run
	make -f Makefile.gpu
To clean the compile, just run
	make -f Makefile.cpu clean
or
	make -f Makefile.gpu clean

== Run ==
All executables are in bin/ directory:
	cn.exe: The main executable for CNTK
	*.so: shared library for corresponding reader, these readers will be linked and loaded dynamically at runtime.

To run the executable, make sure bin/ is in your ${LD_LIBRARY_PATH}, if not, running cn.exe will fail when cn.exe tries to link the corresponding reader. Once it's done, run in command line:
	./cn.exe configFile=${your config file}

== Kaldi Reader ==
This is a HTKMLF reader and kaldi writer (for decode)

To build the cpu/gpu version, run
    make -f Makefile_kaldi.cpu/gpu

The feature section is like:

writer=[
    writerType=KaldiReader
    readMethod=blockRandomize
    frameMode=false
    miniBatchMode=Partial
    randomize=Auto
    verbosity=1
    ScaledLogLikelihood=[
        dim=$labelDim$
        Kaldicmd="ark:-" # will pipe to the Kaldi decoder latgen-faster-mapped
        scpFile=$outputSCP$ # the file key of the features
    ]
]

== Kaldi2 Reader ==
This is a kaldi reader and kaldi writer (for decode)

To build the cpu/gpu version, run
    make -f Makefile_kaldi2.cpu/gpu


The features section is different:

features=[
    dim=
    rx=
    scpFile=
    featureTransform=
]

rx is a text file which contains:

    one Kaldi feature rxspecifier readable by RandomAccessBaseFloatMatrixReader.
    'ark:' specifiers don't work; only 'scp:' specifiers work.

scpFile is a text file generated by running:

    feat-to-len FEATURE_RXSPECIFIER_FROM_ABOVE ark,t:- > TEXT_FILE_NAME

    scpFile should contain one line per utterance.

    If you want to run with fewer utterances, just shorten this file.
    (It will load the feature rxspecifier but ignore utterances not present in scpFile).

featureTransform is the name of a Kaldi feature transform file:
    
    Kaldi feature transform files are used for stacking / applying transforms to features.

    An empty string (if permitted by the config file reader?) or the special string: NO_FEATURE_TRANSFORM
    says to ignore this option.

********** Labels **********

The labels section is also different.

labels=[
    mlfFile=
    labelDim=
    labelMappingFile=
]

Only difference is mlfFile. mlfFile is a different format now. It is a text file which contains:

    one Kaldi label rxspecifier readable by Kaldi's copy-post binary.