CNTK/Scripts
Joannes Vermorel 9c6a1692b8 ctf2bin.py upgraded to Python 3.3+ (#2053)
The script `ctf2bin.py` was broken under Python 3.
The fix provides minimal adjustments for the Python
version that is presently installed by CNTK deployment
scripts.

Fixes Microsoft/CNTK#2052
2017-06-27 10:15:20 -07:00
..
install updating links to old wiki - referencing now the doc site 2017-06-07 15:55:34 +02:00
README.md addressed CR comments 2017-06-07 16:50:37 +02:00
ctf2bin.py ctf2bin.py upgraded to Python 3.3+ (#2053) 2017-06-27 10:15:20 -07:00
pytest.ini Adding tests 2016-06-13 10:16:23 +02:00
txt2ctf.py txt2ctf.py: escape pipe symbol 2016-10-28 15:44:26 +02:00
uci2ctf.py updating links to old wiki - referencing now the doc site 2017-06-07 15:55:34 +02:00

README.md

This directory contains different scripts to support CNTK.

CNTK Binary Installers

The directory install contains scripts which are used in the CNTK binary download to install CNTK on a users system. They are NOT intended to run from this location in the repository.

  • install/windows - A script for installing a Windows CNTK binary drop, cf. here.
  • install/linux - A script for installing a Linux CNTK binary drop, cf. here.

CNTK Text format Converters

Two Python Scripts for converting Data to CNTK Text format for using as an input for CNTK Text Format Reader (https://docs.microsoft.com/en-us/cognitive-toolkit/BrainScript-CNTKTextFormat-Reader).

Convert Dictionary to Text

txt2ctf.py converts a set of dictionary files and a plain text file to CNTK Text format.

Run python txt2ctf.py -h to see usage instructions. See the comments in the beginning of the script file for the specific usage example.

Convert UCI Format to Text

uci2ctf.py converts data stored in a text file in UCI format to CNTK Text format.

Run python uci2ctf.py -h to see usage instructions and example.

For Example:

python Scripts/uci2ctf.py --input_file Examples/Image/MNIST/Data/Train-28x28.txt --features_start 1 --features_dim 784 --labels_start 0 --labels_dim 1 --num_labels 10  --output_file examples/Image/MNIST/Data/Train-28x28_cntk_text.txt
  • input_file - original dataset in the (columnar) UCI format
  • features_start - index of the first feature column (start parameter in the UCIFastReader config, see here
  • features_dim - number of feature columns (dim parameter in the UCIFastReader config)
  • labels_start - index of the first label column
  • labels_dim - number of label columns
  • num_labels - number of possible label values (labelDim parameter in the UCIFastReader config)
  • output_file - path and filename of the resulting dataset.