CNTK/Scripts
Alexey Orlov dff98bd28f Addressed Review comments 2016-07-14 09:50:07 +02:00
..
README.md Addressed Review comments 2016-07-14 09:50:07 +02:00
pytest.ini
txt2ctf.py
uci2ctf.py

README.md

This directory contains different script helping using different components of CNTK.

CNTK Text format Converters

Two Python Scripts for converting Data to CNTK Text format for using as an input for CNTK Text Format Reader (see https://github.com/microsoft/cnTK/wiki/CNTKTextFormat-Reader).

txt2ctf.py 

Converts a set of dictionary files and a plain text file to CNTK Text format. Run python txt2ctf.py -h to see usage instructions. See the comments in the beginning of the script file for the specific usage example.

uci2ctf.py

Converts data stored in a text file in UCI format to CNTK Text format. Run python uci2ctf.py -h to see usage instructions and example. Also see a usage example below:

python Scripts/uci2ctf.py --input_file Examples/Image/MNIST/Data/Train-28x28.txt --features_start 1 --features_dim 784 --labels_start 0 --labels_dim 1 --num_labels 10  --output_file Examples/Image/MNIST/Data/Train-28x28_cntk_text.txt

input_file – original dataset in the (columnar) UCI format features_start – index of the first feature column (start parameter in the UCIFastReader config, see https://github.com/Microsoft/CNTK/wiki/UCI-Fast-Reader) features_dim – number of feature columns (dim parameter in the UCIFastReader config) labels_start - index of the first label column labels_dim – number of label columns num_labels – number of possible label values (labelDim parameter in the UCIFastReader config) output_file – path and filename of the resulting dataset.