Amit Agarwal
64fae7a0bc
Implementation of data-parallel SGD with 1bit quantization of gradients for reduced communication costs
2015-08-05 09:23:33 -07:00
UnderdogGeek
8794397ab9
DiagonalNode and DebugUtil.
2015-08-05 10:44:09 +10:00
chenguoguo
d7c06ef221
Small fix to sequence training
2015-08-04 19:45:16 -04:00
chenguoguo
21d35a0552
Bug fix: fixing dimensionality mismatch after using MEL for sequence training model
2015-08-03 18:30:51 -04:00
Scott Cyphers
ad5b8a20f7
Fix g++ compilation warnings
...
sprintf_s is snprintf, not sprintf
Need to use more symbols from SGD
Add externs for missing file name generators that seem to be not reachable
and not defined.
2015-08-03 14:36:33 -04:00
chenguoguo
d79e00090c
Fixing mixing LSTM state when buffering minibatches
2015-08-01 01:11:53 +00:00
chenguoguo
5996b45890
Some cosmetic change in Kaldi2Reader
2015-07-30 18:00:05 +00:00
chenguoguo
b417ea8caf
Limit number of overflow warnings in Kaldi2Reader; Add configuration for setting overflow value
2015-07-30 17:16:32 +00:00
Hakan Erdogan
d8c5334b41
comment out unnecessary output check and print in Kaldi2Reader that causes problems with non-probability outputs
2015-07-30 04:31:33 -04:00
Dong Yu
0000f40a58
Merge branch 'linux-gcc' of https://git01.codeplex.com/cntk into linux-gcc
2015-07-29 10:40:57 -07:00
Dong Yu
61514b108a
refine PastValueNode and FutureValueNode. Fixed a bug in FutureValueNode When it's not inside a loop.
2015-07-29 10:40:45 -07:00
chenguoguo
95776f8818
Bug fixing regarding m_uttDerivBuffer is NULL
2015-07-29 16:04:21 +00:00
Dong Yu
2489646708
change the pastValueNode and FutureValueNode to support NO_FEATURE and NO_LABEL cases better.
2015-07-28 23:55:21 -07:00
chenguoguo
a31304adf8
Adding interface for utterance derivative computation in the Kaldi2Reader
2015-07-29 03:30:56 +00:00
chenguoguo
94e040a723
Fixing end-of-epoch issue for sequence training
2015-07-27 08:59:16 +00:00
chenguoguo
3cdea4e670
Skipping utterances without lattice or alignment in sequence training
2015-07-26 22:14:24 +00:00
chenguoguo
c1925bb606
adding verbosity support in DataReader/Kaldi2Reader/HTKMLFWriter.cpp
2015-07-25 09:53:30 +00:00
chenguoguo
d98c21f3b3
Changing NO_LABEL to NO_INPUT in Kaldi2Reader
2015-07-25 00:14:48 +00:00
chenguoguo
a0bb630616
updating sentence boundary mark for Kaldi2Reader, HTKMLFReader and KaldiReader
2015-07-24 23:04:31 +00:00
chenguoguo
4b9c8b25fc
Merge branch 'linux-gcc' of https://git.codeplex.com/cntk into linux-gcc
2015-07-24 16:55:29 +00:00
chenguoguo
f0107498aa
Adding suppport for parallelized sequence training in Kaldi2Reader
2015-07-24 16:55:21 +00:00
Yu
fb02f1509a
Merge remote-tracking branch 'origin/master' into linux-gcc
2015-07-23 10:17:26 -04:00
Yu
a839c118b0
Fix some minor bug in Kaldi2Reader.
2015-07-23 10:07:54 -04:00
Dong Yu
fce1212959
Merge branch 'master' of https://git01.codeplex.com/cntk into linux-gcc
...
Conflicts:
DataReader/LMSequenceReader/SequenceReader.cpp
2015-07-22 09:53:02 -07:00
UnderdogGeek
c437f852b6
Rename NDRMParser to SparsePCParser.
2015-07-23 00:15:09 +10:00
UnderdogGeek
41d9515814
Converting NDRM reader to SparsePC reader.
2015-07-23 00:02:39 +10:00
Yinggong ZHAO
a48278ae08
Remove useless merged code
2015-07-21 21:30:10 -07:00
Yinggong ZHAO
8c77b0cbe0
Merge branch 'master' of https://git01.codeplex.com/cntk
...
Conflicts:
DataReader/LMSequenceReader/SequenceReader.cpp
modified: DataReader/LMSequenceReader/SequenceReader.cpp
2015-07-21 21:26:24 -07:00
Yinggong ZHAO
94058b109f
set both for Label Outputs in SequenceReader
2015-07-21 21:24:53 -07:00
ksyao
a567807aa2
Use CPUdevice only for class-based cross entropy for LM reader
2015-07-21 08:54:52 -07:00
ksyao
54e9809e42
LM Sequence reader to use CPU device for labels. Also updated LSTM LM training setup.
2015-07-21 08:07:59 -07:00
Yinggong ZHAO
9ffa9df9c6
Add transfer in getoutput
2015-07-21 06:01:10 -07:00
Yinggong ZHAO
ae76821812
Set shuffle always true, set m_evalMode under old version
2015-07-21 05:35:16 -07:00
Yinggong ZHAO
3e46e6e78d
fix dataend issue, and support old-version model for test
...
modified: DataReader/LMSequenceReader/SequenceReader.cpp
modified: MachineLearning/CNTK/TrainingCriterionNodes.h
2015-07-20 20:11:52 -07:00
Dong Yu
c74cdb58f9
Merge branch 'master' of https://git01.codeplex.com/cntk into linux-gcc
2015-07-20 16:57:26 -07:00
Mike Seltzer
9bb8d4df3a
minor changes based on code review
2015-07-20 12:50:47 -07:00
Mike Seltzer
45e79ad6d8
fix problems in reader when files have duration mismatch
2015-07-20 12:50:45 -07:00
Dong Yu
d1e14783b6
changed feature packing flag names from utteranceXYZ and sentence_XYZ to SequenceXYZ to make them consistent and be understood for both speech and text people.
...
Added flag to indicate the condition of no feature (with assumption that there is also no label) to support efficient bi-directional model training.
Updated the windows version of HTKMLFReader, LM and LU Sequence readers.
Caution: this change will not allow for using past and future label information in the recurrent networks. The ultimate solution is to have a flag for each feature read-in. However, this will require huge change and we will do this after reader refactorization to reduce effort.
2015-07-18 17:28:14 -07:00
chenguoguo
bbf49009d1
Merge branch 'master' to 'linux-gcc'
2015-07-17 23:42:22 +00:00
chenguoguo
d54e2d30fe
Adding multi-utterance support in Kalsi2Reader for Truncated == false
2015-07-17 21:25:10 +00:00
chenguoguo
abc106dd2f
Merge branch 'master' into linux-gcc
2015-07-14 21:52:08 +00:00
Yu
8515a8d2f9
Merge branch 'master' of https://git.codeplex.com/cntk
2015-07-13 22:10:34 -04:00
Yu
fda6fb62ce
Fix multi-io bug for Kaldi2Reader.
2015-07-13 22:09:43 -04:00
Vladimir Ivanov
016f112b21
Bugfix for linux reader: 'no labels for xxx.mfc'
2015-07-13 15:03:48 -07:00
Yu
c944e0da24
Fix bug when m_truncated=false.
2015-07-13 05:49:16 -04:00
Yu
d75b3392ca
Fix build and fix assertion error when we do multi-io training.
2015-07-12 01:17:31 -04:00
Dong Yu
6ae4115b4d
Merge branch 'master' of https://git01.codeplex.com/cntk into linux-gcc
...
Conflicts:
MachineLearning/CNTK/ComputationNetwork.h
MachineLearning/CNTK/NetworkDescriptionLanguage.cpp
MachineLearning/CNTK/SynchronousExecutionEngine.h
2015-07-11 14:58:18 -07:00
chenguoguo
94e54dac66
fixing sentence boundary issues for sequence training
2015-07-11 08:26:15 +00:00
UnderdogGeek
02b36c822d
First implementation of NDRMReader.
2015-07-09 23:26:00 +10:00
Yinggong ZHAO
9be47d5562
Fix potential issue in SequenceReader
2015-07-09 02:12:08 -07:00
Vladimir Ivanov
2c6c940a8f
Build braek fix
2015-07-08 15:49:16 -07:00
Yu
f96ac201a6
Fix non-sequence reader.
2015-07-08 16:52:44 -04:00
chenguoguo
631857bb94
Fixing feature and labels names in Kaldi2Reader; also adding some explanation in README of how to compile CNTK with Kaldi
2015-07-08 01:56:50 +00:00
Yu Zhang
776c20b20a
Merge remote-tracking branch 'origin/master' into linux-gcc
...
Conflicts:
Common/Include/DataReader.h
Documentation/CNTK-TechReport/lyx/CNTKBook_CNTK_Chapter.lyx
MachineLearning/CNTK/TrainingCriterionNodes.h
Math/Math/Matrix.cpp
2015-07-07 17:00:17 -07:00
Yu Zhang
e37ddb8787
Fix Windows builds.
2015-07-07 16:33:14 -07:00
Yu
ba1a969e8e
Fix multi-IO.
2015-07-07 16:11:23 -04:00
Yinggong ZHAO
204b879dfe
Finish GPU NCE training
2015-07-06 20:56:16 -07:00
Yu
471f710f49
Add sentence end information to the reader.
2015-07-06 04:04:26 -04:00
Yu Zhang
f9245c2acc
Merge remote-tracking branch 'origin/dongyu/dev' into linux-gcc
...
Conflicts:
DataReader/HTKMLFReader/HTKMLFReader.cpp
DataReader/HTKMLFReader_linux/HTKMLFReader.cpp
DataReader/KaldiReader/HTKMLFReader.cpp
MachineLearning/CNTK/ComputationNetwork.h
MachineLearning/CNTK/TrainingCriterionNodes.h
Makefile_kaldi2.gpu
2015-07-05 22:53:04 -07:00
Yu Zhang
f0ceb4b395
Merge with master branch.
2015-07-05 22:19:18 -07:00
Yu Zhang
b86990e638
Merge remote-tracking branch 'origin/master' into dongyu/dev
...
Conflicts:
CNTK.sln
Common/Include/Basics.h
DataReader/HTKMLFReader/HTKMLFReader.cpp
DataReader/HTKMLFReader_linux/HTKMLFReader.cpp
DataReader/KaldiReader/HTKMLFReader.cpp
Documentation/CNTK-TechReport/lyx/CNTKBook_CNTK_Adv_Chapter.lyx
MachineLearning/CNTK/ComputationNetwork.h
MachineLearning/CNTK/LinearAlgebraNodes.h
MachineLearning/CNTK/NetworkDescriptionLanguage.cpp
MachineLearning/CNTK/NonlinearityNodes.h
MachineLearning/CNTK/SynchronousExecutionEngine.h
MachineLearning/CNTK/TrainingCriterionNodes.h
Makefile_kaldi2.gpu
2015-07-05 22:16:57 -07:00
Dong Yu
c2b8eceb4e
change definition of SENTENCE_BEGIN etc.
...
Add operator |= to MinibatchPackingFlag
Add support of UtteranceEnd in MinibatchPackingFlag to support bi-directional model in later
2015-07-05 16:57:34 -07:00
Yinggong ZHAO
90be0b8d90
Update NCE GPU Code, forward phase is ready
2015-07-02 05:44:19 -07:00
Yu
df1d881b1d
Fix linux build (use popen in linux instead of _wpopen).
2015-07-02 04:21:36 -04:00
Yu Zhang
04c81b434d
Merge remote-tracking branch 'origin/master' into linux-gcc
...
Conflicts:
CNTK.sln
Common/Include/Basics.h
DataReader/HTKMLFReader/HTKMLFReader.cpp
DataReader/HTKMLFReader_linux/HTKMLFReader.cpp
DataReader/KaldiReader/HTKMLFReader.cpp
MachineLearning/CNTK/ComputationNetwork.h
MachineLearning/CNTK/NetworkDescriptionLanguage.cpp
MachineLearning/CNTK/NonlinearityNodes.h
MachineLearning/CNTK/SynchronousExecutionEngine.h
2015-07-01 23:35:59 -07:00
Yu
9db9bb2604
Feed several frame into the minibatch if there is no new utterance. not the best way, just make it consistent with other readers.
2015-07-02 00:46:14 -04:00
Frank Seide
3aab1d267f
File now supports pipes ("cmd|", "|cmd") and "-" (stdin/stdout) for input and output;
...
stderr redirection now uses File class instead of auto_file_ptr, allowing e.g. stderr="|tee path.log";
new method File::Flush();
class File implementation now uses RuntimeError() and LogicError() instead of throw std::runtime_error() and logic_error();
moved RuntimeError() and class Plugin from basetypes.h to Basics.h;
new VS++ CRT emulation function _wpopen() (Linux only_
2015-07-01 16:17:29 -07:00
chenguoguo
e613a79915
Fix Kaldi2Reader for LSTM -- the previous commit fails after couple of epoch due to a bug
2015-07-01 21:13:22 +00:00
Vladimir Ivanov
f5ce4c8315
Windows->Linux: finalizing reconciliation of Linux vs. Windows version of HTKMLFReader
2015-07-01 14:02:14 -07:00
Vladimir Ivanov
5c590f7417
Windows->Linux: 85f7553 fix bug in multi utterance reader when mlf and feature file are mismatched in duration
2015-07-01 14:02:14 -07:00
Vladimir Ivanov
566c71d796
Windows->Linux: 51daad8 moved nocase_compare to Basics.h, and removed....
2015-07-01 14:02:14 -07:00
Vladimir Ivanov
fe3bcfd1fb
Windows->Linux: 743c5a9 replaced printf format %ws by %ls which is portable between Visual Studio and gcc
2015-07-01 14:02:14 -07:00
Vladimir Ivanov
c6e2d7147d
Windows->Linux: db7b574 Further depress the message in HKTMLFReader
2015-07-01 14:02:14 -07:00
Vladimir Ivanov
61ea159c67
Windows->Linux: bc092c4 Disable redundant information output from releaserandomizedchunk when verbosity =0
2015-07-01 14:02:14 -07:00
Vladimir Ivanov
19926adf49
Windows->Linux - 826ec09 1) Added the missing CNTKMath project dependency...
2015-07-01 14:02:13 -07:00
Vladimir Ivanov
7da17823be
Windows->Linux fd314f1 Modify HTKMLFReader/fileutil.cpp to support load in MLF file larger than 2 GB.
2015-07-01 14:02:13 -07:00
Vladimir Ivanov
120e150617
Windows->Linux: 5444c25 bug fixes in HTKMLFReader: randomizationNone for rollingwindow and context window setting for write
2015-07-01 14:02:13 -07:00
Vladimir Ivanov
36bae006dc
Windows->Linux: a93a2a1 Eliminate "requiredata:..." message when verbosity == 0
2015-07-01 14:02:13 -07:00
Vladimir Ivanov
20045cd5ea
Windows->Linux: 9e741ec Further remove message in HTKMLFReader when verbosity == 0
2015-07-01 14:02:13 -07:00
Vladimir Ivanov
56e992b183
Windows->Linux: c3aad5b Using verbosity in HTKMLFReader to control the output.
2015-07-01 14:02:13 -07:00
Vladimir Ivanov
c88e4cf20d
Whitespace-only change: undoing some whitespace changes
...
Those changes made simulatenously with porting of HTKMLFReader to linux.
Reverting those changes allows better code comparison and helps to
reconcile windows vs. linux versions of the reader
2015-07-01 14:02:13 -07:00
Yu
dc6c7db1c9
Clean HTKMLFReader_linux
2015-06-30 16:21:34 -04:00
chenguoguo
a3450a7d36
Merge remote-tracking branch 'origin/dongyu/dev' into linux-gcc
2015-06-30 15:00:58 +00:00
Yu
a7c5de7b52
Fix all the reader to use the new interface
2015-06-29 02:16:23 -04:00
chenguoguo
27b00ea842
linux-gcc: merging master
2015-06-29 02:17:21 +00:00
Dong Yu
72326cb325
use vector of flags to indicate each column's packing state.
...
fixed bugs in the original ResetBounds call in the delay node.
2015-06-27 00:02:21 -07:00
Yu
8adff3546b
Make Kaldi2Reader to consistent with the new interface
2015-06-26 00:48:34 -04:00
Yu
e980861763
Fix the reader speed issue when we have multiple cntk on one machine
2015-06-25 21:34:41 -04:00
Yu
d703a2b718
Merge branch 'Dev' of https://git.codeplex.com/cntk into Dev
...
Conflicts:
MachineLearning/CNTK/RecurrentNodes.h
2015-06-21 01:54:54 -04:00
Yu
e2f52516be
Add change to support load multiple readers
2015-06-20 16:57:58 -04:00
kaisheny
f332421b7b
Merge with new changes including RowStackNode
2015-06-16 16:01:19 -07:00
Dong Yu
99af4139a5
Merge branch 'Dev' of https://git01.codeplex.com/cntk with master
...
fix compilation bug in HTKMLFReader
Conflicts:
.gitignore
MachineLearning/CNTK/ComputationNetwork.h
MachineLearning/CNTK/ComputationNode.h
MachineLearning/CNTK/LinearAlgebraNodes.h
MachineLearning/CNTK/NetworkDescriptionLanguage.cpp
MachineLearning/CNTK/NonlinearityNodes.h
MachineLearning/CNTK/SynchronousExecutionEngine.h
2015-06-16 11:49:19 -07:00
kaisheny
01468f3fb6
Merge branch 'Dev' of https://git01.codeplex.com/cntk into Dev
2015-06-16 07:28:25 -07:00
Yu Zhang
16c121df3d
Merge branch 'Dev' of https://git01.codeplex.com/cntk
2015-06-14 23:31:22 -07:00
Yu Zhang
b0fd604d1d
Fix the compile error on windows for htkmlfreader
2015-06-14 23:31:08 -07:00
Yu
31171bfc1b
Merge branch 'Dev' of https://git.codeplex.com/cntk into Dev
...
Conflicts:
MachineLearning/CNTK/RecurrentNodes.h
2015-06-15 02:28:07 -04:00
Yu
3994dab33e
Fix some compile errors on linux and also change the matrix multiply to for loop for the delay node.
2015-06-15 02:08:04 -04:00
Dong Yu
c44aba7aed
Merge branch 'master' of https://git01.codeplex.com/cntk
2015-06-10 13:45:07 -07:00
Dong Yu
346dc11cde
fix bugs in the binaryReader and UCIFastReader
2015-06-10 13:44:54 -07:00
kaisheny
32dde12e37
local change in dev branch before merging it to smt branch.
2015-06-08 22:12:44 -07:00
Yu
e4424d5697
Fix the comile on linux for kaldi reader
2015-06-07 18:19:26 -04:00
Mike Seltzer
02080fc0f6
fix bug in multi utterance reader when mlf and feature file are mismatched in duration
2015-06-04 13:12:10 -07:00
Dong Yu
4b29673fda
Fix the error throw bugs in UCIParser. Now if a file cannot be opened the error will be thrown and caught and cntk will exit gracefully.
2015-06-03 16:57:02 -07:00
kaisheny
1239acfe4c
Make reader to be a meta reader that can loads multiple readers
2015-06-02 17:45:26 -07:00
chenguoguo
d436bf2ac8
Further fixes to sequence training on Kaldi lattices (move data from GPU to CPU before lattice computation)
2015-06-02 16:25:45 -04:00
chenguoguo
a42229b8a7
bug fix: frameMode was always false in Kaldi2Reader due to my previous commit
2015-06-02 02:46:11 -04:00
Yinggong ZHAO
026426ad08
In sequencereader move labelOutput to DeviceId and remove useless code in NCE-LSTM, which will move data from GPU to CPU
2015-06-01 23:29:25 -07:00
chenguoguo
12e4543373
fix small issues in sequence training; undo one MPI related change in SGD.h that was accidently checked in yesterday.
2015-06-01 17:50:28 -04:00
chenguoguo
99d99b3887
Adding code for smbr/mpfe training on Kaldi lattices; this is the initial working version, more will follow
2015-05-31 19:03:44 -04:00
chenguoguo
b87775a353
Minor changes to make CNTK compile with gcc-4.7
2015-05-31 18:19:39 -04:00
kaisheny
ebe575481e
LUSequenceReader : if obs is larger than its dimension, check if no-label is set.
2015-05-29 22:44:35 -07:00
Yu
f9a851b485
Replace some window specific function
2015-05-28 20:42:19 -04:00
kaisheny
4b12277e0a
merge with master branch code
2015-05-28 11:15:03 -07:00
kaisheny
d3b2007147
Remove OrganizeClass function from sequence reader.
2015-05-28 10:15:07 -07:00
kaisheny
0eddd57f8d
Use CPU matrix in Sequence reader. Specify unk in reader.
2015-05-27 17:12:59 -07:00
kaisheny
a78f80d5b5
serialization of NCE evalMode for NoiseContrastiveEstimationNode. Seperate training and evaluation nodes for NCE because they require different mode. Report error when run NCE in GPU because that is not supported yet.
2015-05-26 16:55:35 -07:00
kaisheny
088ed7c066
LM Sequence Reader uses the new way to reset RNN state
2015-05-26 13:54:01 -07:00
Yinggong ZHAO
7cc257b45c
add unnormalized evaluation for NCE
2015-05-26 02:27:23 -07:00
kaisheny
0ba2bc05c0
LUSequenceReader changes: use unk word if a word is not i no observed list. Support read word class information from LU Sequence Reader (mode=class)
2015-05-24 23:14:58 -07:00
Amit Agarwal
5863e2e0f5
Minor format specifier cross-platform fixes
2015-05-22 00:25:44 -07:00
kaisheny
862d1f04cc
small comment change for LUSequenceReader. Add to-do in recurrentnodes.h for delaynode.
2015-05-20 17:15:39 -07:00
kaisheny
b1f7263326
Merge from master branch
2015-05-20 17:11:52 -07:00
kaisheny
7014ed24ba
Reader changes to use new way of reseting RNN states and criterion nodes. Use 0/1 to denote sentence begining and middle. -1 to denote no label. Reader needs to pass two matrices of segmentation informatoin. One is a matrix for saving -1/0/1 information and the other is a one-row matrix saving 0/1 mask. In the latter case ,0 means existence of sentence begining or no labels; 1 means no such case.
2015-05-20 14:02:25 -07:00
Yinggong ZHAO
848104dcda
add comments for BatchSequenceReader::GetLabelOutput
2015-05-20 02:23:31 -07:00
amitaga
c8666ba3a8
Fix a couple of Linux build issues
2015-05-19 17:00:51 -07:00
Amit Agarwal
5afa3d73e5
Merge branch 'linux-gcc' of https://git01.codeplex.com/cntk
2015-05-19 16:21:56 -07:00
amitaga
91a27cd9fc
Merge branch 'master' of https://git.codeplex.com/cntk into linux-gcc
...
Conflicts:
Common/Include/minibatchsourcehelpers.h
DataReader/DSSMReader/DSSMReader.cpp
DataReader/HTKMLFReader/HTKMLFReader.cpp
DataReader/LMSequenceReader/SequenceReader.cpp
DataReader/LMSequenceReader/minibatchsourcehelpers.h
DataReader/LUSequenceReader/LUSequenceReader.cpp
DataReader/LUSequenceReader/minibatchsourcehelpers.h
DataReader/LibSVMBinaryReader/LibSVMBinaryReader.cpp
DataReader/UCIFastReader/UCIFastReader.cpp
MachineLearning/CNTK/ComputationNetwork.h
MachineLearning/CNTK/SGD.h
MachineLearning/CNTK/SimpleNetworkBuilder.h
MachineLearning/CNTK/SynchronousExecutionEngine.h
MachineLearning/cn/PTaskGraphBuilder.cpp
Makefile.cpu
Makefile.gpu
Math/Math/NoGPU.cpp
2015-05-19 16:04:47 -07:00
Frank Seide
c31fa8bf28
added ... expansion to HTKMLFReader, allows pathnames in SCP files thata re relative to the SCP file itself (... stands for the directory of the SCP file)
2015-05-19 15:45:26 -07:00
Frank Seide
a059635386
moved nocase_compare to Basics.h, and removed unnecessary dependencies caused by this on commandArgUtil.h, hoping for less recompilation;
...
renamed PreprocessConfigLine() to StripComments();
updated CNTKBook PDF
2015-05-19 14:32:21 -07:00
erw
0ff74f0405
The commits 32391e1..2a7ab60 should not be pushed to public (since it enables MKL and MPI) revert them in this commit.
...
Reverted commit:
8404e9c8de
"Enable MKL"
64a37301f9
"Enable MPI_SUPPORT"
61c6d59e04
"add a unit test for CNTKEval"
2015-05-19 14:08:16 -07:00
erw
61c6d59e04
add a unit test for CNTKEval
...
This unit test is used privately by erw.
Conflicts:
CNTKSolution/EvalUnitTest/CNTKEValTest.cpp
CNTKSolution/EvalUnitTest/EvalUnitTest.vcxproj
CNTKSolution/EvalUnitTest/EvalUnitTest.vcxproj.filters
2015-05-19 12:31:46 -07:00
Frank Seide
89ea3f3448
replaced printf format %ws by %ls which is portable between Visual Studio and gcc
2015-05-19 00:24:08 -07:00
kaisheny
3490c66615
Rename to NO_LABELS and ResetForNoLabels
2015-05-18 23:18:49 -07:00
Frank Seide
5445952422
(removed a defunct reference from the UCIFastReader VS Project)
2015-05-18 20:37:41 -07:00
Frank Seide
178555d0a4
Visual Studio Solution cleanup:
...
moved minibatchsourcehelpers.h to Common/Include, as it is shared amongst readers (exception: the HTK reader, which has a different version);
fixed some incorrect #include of basetypes.h;
copying of DLLs now does not copy if the file already exists and is not out of date;
prebuild.bat now only updates builddate.h if it has changed, to avoid CNTK.cpp to be recompiled each time;
fixed a few build warnings in the CUDA kernels (uninitialized variables);
removed unused PTasks references in Linux makefiles
2015-05-18 20:32:39 -07:00
Frank Seide
08625d482d
created a new header file Basics.h, to take on the whole of the legacy basetypes.h (which was to contain a very limited set of generally useful core functions, like UTF8 conversion)
2015-05-18 19:36:04 -07:00
Frank Seide
8856fa40e9
Merge branch 'master' of https://git.codeplex.com/cntk
2015-05-18 18:51:31 -07:00
Frank Seide
392d64da08
various cleanups:
...
renamed cn.exe to CNTK.exe on Windows and "cntk" (no more .exe!) on Linux;
renamed 'cn' folder to 'CNTK';
moved CNTK.sln out from CNTKSolution folder, which now no longer needs to exist;
removed Win32 build target from Visual Studio Solution and all Projects;
added are documentation and Linux makefiles to the Visual Studio Solution for easy access;
removed unused PTasks files
2015-05-18 18:51:20 -07:00
Yinggong ZHAO
3dbed2fc89
use default seed for sampler
2015-05-18 05:18:00 -07:00
unknown
dad030381e
modified: SequenceReader.cpp
...
modified: SequenceReader.h
2015-05-18 00:46:27 -07:00
Yinggong ZHAO
0462453a1b
modified: SequenceReader.h
2015-05-16 22:07:38 +08:00
Yu
d4e025e35f
Repeat the first couple of labels when set delay targets
2015-05-08 08:34:52 -04:00
Yu
204419ba80
Add delay output option to the Kaldi2Reader
2015-05-07 21:03:29 -04:00
kaisheny
edd8c08040
LUSequenceREader use number of samples instead of number of sentences so that it is consistent to what is expected in SGD.h.
2015-05-06 11:04:07 -07:00
kaisheny
9beea90918
deal with no-blank lines at the end of file in LU Sequence Reader
2015-05-06 00:05:20 -07:00
kaisheny
a93ada346b
Fix LU sequence reader bug to get the id of end of sentence symbol
2015-05-05 13:50:38 -07:00
kaisheny
be0f8aaf24
Removed unused code in LUSequenceReader. update getframe function to use sparse input.
2015-05-01 11:55:51 -07:00
erw
0b72e35836
Further depress the message in HKTMLFReader
2015-04-30 14:14:51 -07:00
ksyao
ef0113ce0d
LUSequenceReader : Use local CPUsparsematrix to read data and then pass data to GPU/CPU sparsematrix as features
2015-04-30 12:02:42 -07:00
ksyao
04b0b0e1a1
Chanage LU Sequence reader to work on sparse input
2015-04-29 09:31:31 -07:00
kaisheny
3a89639d40
Add nogpu functions for RCRF related functions
2015-04-27 22:22:17 -07:00
kaisheny
93c895ed0e
Merge with master branch
2015-04-22 23:20:34 -07:00
kaisheny
cd42428e07
Fix bug in LUSequence Reader, otherwise read from begining even for single pass of data.
2015-04-21 22:37:06 -07:00
kaisheny
0f74cd8dd2
reset to doc begining when read to end.
2015-04-21 18:09:16 -07:00
kaisheny
fad6322a29
Simplified LU sequence by removing unnecessary code. Also use wstring to support reading wchar strings. Need to make sure that large amount of data can be read, because reader reads at most CACHE_BLOG_SIZE data.
2015-04-20 16:04:39 -07:00
Chiyuan Zhang
94327e2a18
Fix compile error for old gcc version
2015-04-15 23:06:50 -04:00
Chiyuan Zhang
ff80ec1b7b
Fix some compile error for old gcc
2015-04-15 23:05:04 -04:00
kaisheny
0ece27ccc5
LUSequenceWriter outputs results to stderr
2015-04-13 15:48:33 -07:00
kaisheny
f694eda4c1
Set label uses CPU Device
2015-04-10 15:43:55 -07:00
kaisheny
43cb5e0e25
Add InitProposals, GetSentencendIdFromOutputLabel and GetProposals in dataread. Update DataEnd in LUSequenceReader so that every minibatch works as sentence-level minibatch.
2015-04-10 14:19:14 -07:00
kaisheny
b6610615a9
Add ignoresentencebegin tag to treat sentence begining frames as in-the-middle-of-sentence frame. Useful for telling BPTT to use state from external, such as states from another network, e.g., encoder network.
2015-04-09 14:49:55 -07:00
Yu
9b0b06ec6f
Merge remote-tracking branch 'origin/master' into linux-gcc
2015-04-09 14:35:50 -04:00
Dong Yu
9ed33db20b
fixed several warning that happens under gcc.
2015-04-09 02:10:38 -07:00
Yu
319ea82d56
Clean the warnings.
2015-04-09 01:09:58 -04:00
kaisheny
14ce7f4f2f
Use -1,0,1 to denote no observation, sentence begining and in-the-middle-of-sentence. changed LU sequence reader. But other readers haven't changed accordingly.
2015-04-08 21:20:12 -07:00
Yu
0e5a54bc44
Fix compile error
2015-04-08 13:25:35 -04:00
Yu
5ef4bcd790
Merge remote-tracking branch 'origin/master' into linux-gcc
...
Conflicts:
MachineLearning/cn/CompositeComputationNode.h
Math/Math/CPUMatrix.cpp
2015-04-07 00:35:17 -04:00
erw
4683c541b6
Disable redundant information output from releaserandomizedchunk when verbosity =0
2015-04-05 21:41:33 -07:00
Yu
66a8ea9e3d
Merge remote-tracking branch 'origin/master' into linux-gcc
2015-04-05 16:31:26 -04:00
Yu
ef4c833a4d
Check in the KaldiReader (version2)
2015-04-05 16:28:16 -04:00
kaisheny
13c49fc3ad
LUSequenceReader changes: using multiple io stream.
2015-04-03 14:03:17 -07:00
Dong Yu
f4d63f7352
change the ReadClassInfo function in the LM sequence reader so that it also works when the training set is large.
2015-04-01 20:44:02 -07:00
Dong Yu
8e5e24fbd6
fix compile errors in LibSVMBinaryReader
2015-03-30 23:12:58 -07:00
Amit Agarwal
6a1ce07561
1) Added the missing CNTKMath project dependency to project LMSequenceReader to enforce correct ordering of the build. 2) A bunch of changesto fix the Windows build. 3) Changed Linux makefiles to use CUDA 7.0
2015-03-27 10:36:32 -07:00
Yu
e1c964cfa1
Merge remote-tracking branch 'origin/master' into linux-gcc
...
Conflicts:
DataReader/LMSequenceReader/SequenceParser.h
DataReader/LMSequenceReader/SequenceReader.cpp
MachineLearning/cn/ComputationNetwork.h
Math/Math/CPUMatrix.cpp
Math/Math/CPUSparseMatrix.cpp
Math/Math/GPUMatrix.cu
2015-03-25 22:15:58 -04:00
Yu
df6087ba76
Merge remote-tracking branch 'origin/master' into linux-gcc
...
Conflicts:
DataReader/SequenceReader/SequenceReader.cpp
MachineLearning/cn/ComputationNode.h
MachineLearning/cn/cn.cpp
Math/Math/CPUSparseMatrix.cpp
Math/Math/GPUMatrix.cu
Math/Math/GPUSparseMatrix.cu
Math/Math/GPUSparseMatrix.h
2015-03-25 21:49:59 -04:00
Dong Yu
77416059e2
fixed bugs when MKL is turned on.
...
change sequencereader folder to LMSequenceReader.
2015-03-24 22:58:11 -07:00
Yu
34b46e6499
Add KaldiReader into the source tree. Actually, it is a HTKMLFReader with kaldi-io.
2015-03-20 16:24:46 -04:00
Yu
d35570ba6d
Fixed the error for the LM modeling in linux side. Bug: undefined behavior when use index[a] = index.size()
2015-03-20 15:57:31 -04:00
Dong Yu
a99c4636fd
Port to CUDA 7.0
...
Changed the writeWordAndClass command to produce the vocabulary, frequency, and word to class mapping information directly from the training set.
2015-03-19 01:04:49 -07:00
Dong Yu
bfb0de4764
display file, line, and function name info when a feature is not implemented.
...
display warning on dropout and temp memory for CNN only when non-default value is set.
change the name of Sequencereader project to LMSequenceReader
Change the corresponding config file that use LMSequenceReader.
2015-03-11 23:05:23 -07:00
Dong Yu
8beb5249c9
change TrainRNN to Train in all the config files since TrainRNN and Train now do the same thing.
...
Rewrite the RNNLM.NDL for class based Rnn LM.
removed dead code in SequenceReader.cpp
Change SparseLearnableParameters in the simple network builder to Learnable Parameters.
change the argument name and comment in the ClassBasedCrossEntropyWithSoftmaxNode to make it clearer the meaning of each input argument.
2015-03-09 00:28:55 -07:00
Yu
cba7c67101
Fix bug in HCIFastReader (segmentation fault because labelIdType is uninit)
2015-03-01 20:59:07 -05:00
Yu
c4a1ee1031
Merge remote-tracking branch 'origin/master' into linux-gcc
...
Conflicts:
DataReader/SequenceReader/SequenceReader.cpp
MachineLearning/cn/ComputationNetwork.h
MachineLearning/cn/NetworkDescriptionLanguage.cpp
MachineLearning/cn/SGD.h
MachineLearning/cn/cn.cpp
Math/Math/CPUMatrix.cpp
Math/Math/CPUSparseMatrix.cpp
Math/Math/CPUSparseMatrix.h
Math/Math/GPUMatrix.cu
Math/Math/GPUSparseMatrix.cu
Math/Math/GPUSparseMatrix.h
2015-03-01 20:01:26 -05:00
Yu
e33afa7b79
Add Kaldi header to cntk
2015-03-01 14:56:07 -05:00
Yu
89a9e8935f
Fix a bug in HTKMLFWriter for Linux.
2015-02-28 22:52:16 -05:00
kaisheny
8ebdd580fa
add ranomization in lu sequence reader
2015-02-24 21:30:30 -08:00
kaisheny
c9f7164ef6
Set sentence begining by looking at the input sequence instead of output sequence. Ignore Sparse matrix in gradient check (to-do to fix this)
2015-02-24 16:22:41 -08:00
thhoens
5a2ae4fd8a
Full commit of the LibSVMBinaryReader. Somehow the cpp/h files were not added before.
2015-02-17 16:05:47 -08:00
thhoens
7fe0af4446
Merge branch 'thhoens' of http://vstfpg07:8080/tfs/msrspeech/msrspeech/_git/cntk
2015-02-16 19:05:49 -08:00
Dong Yu
5ecc77cb06
Merge branch 'master' of https://git01.codeplex.com/cntk into DSSM-Internal
...
Conflicts:
CNTKSolution/CNTKSolution.sln
MachineLearning/cn/ComputationNetwork.h
MachineLearning/cn/TrainingCriterionNode.h
Math/Math/CPUMatrix.cpp
Math/Math/CPUMatrix.h
Math/Math/GPUMatrixCUDAKernels.cu
Math/Math/Matrix.h
2015-02-16 13:35:59 -08:00
Dong Yu
22d7769b04
make Resize and SwitchToMatrixType to use explicit specification of arguments to avoid unexpected behavior.
...
Completely remove PTask. This makes code clean and much easier to understand.
Added SetMatrixFromCSCFormat to CPU sparse matrix.
Reimplemented Set(Get)MatrixFromCSC(CSR)Format functions to take CPUSPARSE_INDEX_TYPE. changed GPU sparse matrices' persistence code accordingly.
2015-02-16 13:18:57 -08:00
kaisheny
5651eeb210
Update sequence reader to use CPUDEVICE for label always.
2015-02-15 22:25:44 -08:00
kaisheny
775c0e501a
Update sequence reader to assign class information to output label.
2015-02-14 22:36:21 -08:00
kaisheny
41a1338092
Update ClassBasedCrossEntropyNode. Modified sequence reader to use dense matrix to represent labels at each time. Cleaned SGD.h to remove SetOtherInfo function that calls class-based LM setup functions, which should be done in reader. Updated simple network builder for class-based LM. Added MinusOneAt function for Matrix. Updated CNTK book chapter.
2015-02-14 21:38:23 -08:00
thhoens
25674402db
Merge branch 'thhoens' of http://vstfpg07:8080/tfs/msrspeech/msrspeech/_git/cntk
2015-02-13 12:22:46 -08:00
kaisheny
f2004053a3
Output missing label information when the label is not found.
2015-02-13 11:17:16 -08:00
Dong Yu
da76314061
Merge branch 'thhoens' of http://vstfpg07:8080/tfs/msrspeech/msrspeech/_git/cntk into DSSM-Internal
2015-02-12 18:02:30 -08:00
Dong Yu
b67ca20a9f
Merge branch 'master' of http://vstfpg07:8080/tfs/msrspeech/msrspeech/_git/cntk into DSSM-Internal
...
Conflicts:
Common/ConfigFile.cpp
Common/File.cpp
Common/fileutil.cpp
MachineLearning/cn/cn.cpp
Math/Math/GPUMatrix.cu
Math/Math/GPUMatrixCUDAKernels.cu
Math/Math/GPUSparseMatrix.cu
2015-02-11 17:35:08 -08:00
kaisheny
c1e94a0ce0
Update LUSequence Parser to suport blank lines as sentence ending
2015-02-11 10:10:15 -08:00