CNTK/Math в 71c5aaba063ab82c745bcc4288a93ce1ea0685f5 - CNTK

KeDengMS 40d0b79451 Add trace for nodes with abs sum to make compare easier, with trace level 1000	2017-01-05 13:09:12 -08:00
..
BatchNormalizationEngine.cpp	CNTK v2 library: Several bug fixes	2016-09-20 22:54:07 -07:00
BatchNormalizationEngine.h	minor change in definition of BatchNormEngine to avoid warnings during compilation with cuda8 compiler	2016-10-18 18:46:15 +02:00
BlockHandlerAVX.cpp	Implement code review feedback from clemensm	2016-06-24 08:40:48 -07:00
BlockHandlerAVX.h	Fix path differing only in case. Fix hard tabs.	2016-09-19 09:13:31 +02:00
BlockHandlerSSE.cpp	enabling cross-compile of the cntk sources for arm64 devices.	2016-10-14 10:09:40 +02:00
BlockHandlerSSE.h	Implement code review feedback from clemensm	2016-06-24 08:40:48 -07:00
BlockMultiplier.h	Add math unit tests and adapt them for Linux	2016-07-13 16:03:06 +02:00
BlockMultiplierMatrixUtil.h	Implement code review feedback from clemensm	2016-06-24 08:40:48 -07:00
BlockMultiplierPlatform.h	Implement comments from code review. Have not tried to compile.	2016-06-16 16:17:21 -07:00
CPUMatrix.cpp	Align randomization across platforms	2016-12-21 17:31:22 +01:00
CPUMatrix.h	Align randomization across platforms	2016-12-21 17:31:22 +01:00
CPURNGHandle.cpp	Align randomization across platforms	2016-12-21 17:31:22 +01:00
CPURNGHandle.h	Align randomization across platforms	2016-12-21 17:31:22 +01:00
CPUSparseMatrix.cpp	Add trace for nodes with abs sum to make compare easier, with trace level 1000	2017-01-05 13:09:12 -08:00
CPUSparseMatrix.h	Fix sparse matrix MultiplyAndAdd bug that old value may not be kept	2016-12-06 23:49:01 -08:00
CUDAPageLockedMemAllocator.cpp	Fix path differing only in case. Fix hard tabs.	2016-09-19 09:13:31 +02:00
CUDAPageLockedMemAllocator.h	Re-format code using clang-format (plus some post-processing)	2016-01-18 09:36:14 +01:00
CntkBatchNormalization.cuh	CntkBatchNormalization.cuh: fix for batchSize == 1	2016-08-25 15:37:23 +02:00
ColumnQuantizer.h	tuned >1-bit SGD: odd #quantization levels, range now 4 stddevs (before: 5)	2016-08-21 01:33:01 -07:00
CommonMatrix.h	added TempDeviceBuffer to CommonMatrix.h	2016-11-19 21:53:24 +01:00
Convolution.cuh	Addressed BS CR comments	2016-10-04 14:55:10 +02:00
ConvolutionEngine.cpp	CNTK core: Implemented an optimization to elide the initial zeroing and subsequent accumulation into gradients for nodes with just one parent/accestor node	2016-12-12 12:06:17 -08:00
ConvolutionEngine.h	CNTK core: Implemented an optimization to elide the initial zeroing and subsequent accumulation into gradients for nodes with just one parent/accestor node	2016-12-12 12:06:17 -08:00
ConvolveGeometry.h	Fix autoPad issue using GetAutoPad().	2017-01-04 16:26:33 -08:00
CuDnnBatchNormalization.cu	CntkBatchNormalization.cuh: fix for batchSize == 1	2016-08-25 15:37:23 +02:00
CuDnnCommon.cpp	Changes Addressing Code Review for CUDNN RNNStack Node	2016-08-23 10:40:34 -07:00
CuDnnCommon.h	Address CR comments	2016-08-23 20:32:03 +02:00
CuDnnConvolutionEngine.cu	Fix autoPad issue using GetAutoPad().	2017-01-04 16:26:33 -08:00
CuDnnFactories.h	Introducing global flags	2016-09-01 10:26:18 +02:00
CuDnnRNN.cpp	Changes Addressing Code Review for CUDNN RNNStack Node	2016-08-23 10:40:34 -07:00
CuDnnRNN.h	(made gcc happy)	2016-08-24 19:51:02 -07:00
DataTransferer.cpp	First version of GPU prefetch	2016-09-21 12:00:04 +02:00
DataTransferer.h	First version of GPU prefetch	2016-09-21 12:00:04 +02:00
GPUDataTransferer.cpp	Remove statics from prefetch transferer	2016-12-07 11:57:48 +01:00
GPUDataTransferer.h	Remove statics from prefetch transferer	2016-12-07 11:57:48 +01:00
GPUMatrix.cu	cuda related vs2015 changes	2016-11-29 11:57:01 +01:00
GPUMatrix.h	Merge branch 'master' into v-yuxgu/memory-optv2	2016-11-08 16:14:12 +08:00
GPUMatrixCUDAKernels.cuh	Fix sparse matrix MultiplyAndAdd bug that old value may not be kept	2016-12-06 23:49:01 -08:00
GPURNGHandle.cu	Use 8 bytes for RNG(Handle) seed on both platforms.	2016-12-13 12:37:18 +01:00
GPURNGHandle.h	Use 8 bytes for RNG(Handle) seed on both platforms.	2016-12-13 12:37:18 +01:00
GPUSparseMatrix.cu	Add trace for nodes with abs sum to make compare easier, with trace level 1000	2017-01-05 13:09:12 -08:00
GPUSparseMatrix.h	fixes for dense times sparse	2016-11-19 21:53:24 +01:00
GPUTensor.cu	optimized DelayedValueNode by using non-syncing mask	2016-09-05 16:46:16 -07:00
GPUTensor.h	merged from master	2016-07-26 16:51:19 -07:00
GPUWatcher.cu	normalized inconsistent commenting style from "//text" to "// text"	2016-01-22 13:58:47 -08:00
GPUWatcher.h	License change	2016-01-18 09:36:17 +01:00
Helpers.h	Inlining checks	2016-06-21 17:38:54 +02:00
Math.vcxproj	Align randomization across platforms	2016-12-21 17:31:22 +01:00
Math.vcxproj.filters	Quantized multiplier v1	2016-11-27 21:25:26 -08:00
MathCUDA.vcxproj	.vcxproj: remove <BrowseInformation/> and <Bscmake/>	2016-10-18 10:29:54 +02:00
MathCUDA.vcxproj.filters	merge	2016-10-12 13:25:17 +02:00
Matrix.cpp	Add trace for nodes with abs sum to make compare easier, with trace level 1000	2017-01-05 13:09:12 -08:00
Matrix.h	CNTK V2 library: Added Value::Create method to batch sparse sequences	2016-12-16 01:27:22 -08:00
MatrixQuantizerCPU.cpp	tuned >1-bit SGD: odd #quantization levels, range now 4 stddevs (before: 5)	2016-08-21 01:33:01 -07:00
MatrixQuantizerCPU.h	White-space (remove CRs that crept in)	2016-01-25 10:24:10 +01:00
MatrixQuantizerGPU.cu	First version of distributed aggregator for v2	2016-10-19 13:49:10 +02:00
MatrixQuantizerGPU.h	normalized inconsistent commenting style from "//text" to "// text"	2016-01-22 13:58:47 -08:00
MatrixQuantizerImpl.cpp	Re-format code using clang-format (plus some post-processing)	2016-01-18 09:36:14 +01:00
MatrixQuantizerImpl.h	First version of all reduce aggregator	2016-10-24 19:25:36 +02:00
MatrixQuantizer_kernel.cu	normalized inconsistent commenting style from "//text" to "// text"	2016-01-22 13:58:47 -08:00
MemAllocator.h	Re-format code using clang-format (plus some post-processing)	2016-01-18 09:36:14 +01:00
NcclComm.cpp	Source/Math/NcclComm.cpp: tune logging	2016-11-09 14:00:53 +00:00
NcclComm.h	Added NCCL support to SimpleDistGradAggregator	2016-11-08 18:15:01 -08:00
NoGPU.cpp	Use 8 bytes for RNG(Handle) seed on both platforms.	2016-12-13 12:37:18 +01:00
QuantizedMatrix.cpp	addressing CR comments.	2016-11-23 23:12:32 +01:00
QuantizedMatrix.h	First version of all reduce aggregator	2016-10-24 19:25:36 +02:00
QuantizedOperations.h	Quantized multiplier v1	2016-11-27 21:25:26 -08:00
Quantizers.h	Quantized multiplier v1	2016-11-27 21:25:26 -08:00
RNGHandle.cpp	Use 8 bytes for RNG(Handle) seed on both platforms.	2016-12-13 12:37:18 +01:00
RNGHandle.h	Use 8 bytes for RNG(Handle) seed on both platforms.	2016-12-13 12:37:18 +01:00
RNNCommon.h	Incorporated CR feedback	2016-11-17 17:14:34 -08:00
TensorOps.h	ReduceLogSum: beautifications	2016-08-05 14:08:54 +02:00
TensorView.cpp	Quantized multiplier v1	2016-11-27 21:25:26 -08:00
TensorView.h	Quantized multiplier v1	2016-11-27 21:25:26 -08:00
ValueQuantizer.h	fix more warning descriptions	2016-11-23 12:01:31 +01:00
cudabasetypes.h	Re-format code using clang-format (plus some post-processing)	2016-01-18 09:36:14 +01:00
cudadevice.h	Removing unused dependencies	2016-06-21 17:38:54 +02:00
cudalattice.cpp	cuda related vs2015 changes	2016-11-29 11:57:01 +01:00
cudalattice.h	Re-format code using clang-format (plus some post-processing)	2016-01-18 09:36:14 +01:00
cudalatticeops.cu	Re-format code using clang-format (plus some post-processing)	2016-01-18 09:36:14 +01:00
cudalatticeops.cu.h	cuda related vs2015 changes	2016-11-29 11:57:01 +01:00
cudalatticeops.h	make parameter explicit by adding namespace	2016-11-23 10:39:35 +01:00
cudalib.cpp	normalized inconsistent commenting style from "//text" to "// text"	2016-01-22 13:58:47 -08:00
cudalib.h	Re-format code using clang-format (plus some post-processing)	2016-01-18 09:36:14 +01:00
dllmain.cpp	License change	2016-01-18 09:36:17 +01:00
latticefunctionskernels.h	reviewed uses of atomicAdd(),;	2016-05-05 11:02:21 -07:00
stdafx.cpp	License change	2016-01-18 09:36:17 +01:00
stdafx.h	renamed EsotericNodes.h to SpecialPurposeNodes.h, and CompositeComputationNodes.h to PreComputeNodes.h;	2016-01-22 10:45:53 -08:00
targetver.h	License change	2016-01-18 09:36:17 +01:00

BatchNormalizationEngine.cpp

CNTK v2 library: Several bug fixes

2016-09-20 22:54:07 -07:00

BatchNormalizationEngine.h

minor change in definition of BatchNormEngine to avoid warnings during compilation with cuda8 compiler

2016-10-18 18:46:15 +02:00

BlockHandlerAVX.cpp

Implement code review feedback from clemensm

2016-06-24 08:40:48 -07:00

BlockHandlerAVX.h

Fix path differing only in case. Fix hard tabs.

2016-09-19 09:13:31 +02:00

BlockHandlerSSE.cpp

enabling cross-compile of the cntk sources for arm64 devices.

2016-10-14 10:09:40 +02:00

BlockHandlerSSE.h

Implement code review feedback from clemensm

2016-06-24 08:40:48 -07:00

BlockMultiplier.h

Add math unit tests and adapt them for Linux

2016-07-13 16:03:06 +02:00

BlockMultiplierMatrixUtil.h

Implement code review feedback from clemensm

2016-06-24 08:40:48 -07:00

BlockMultiplierPlatform.h

Implement comments from code review. Have not tried to compile.

2016-06-16 16:17:21 -07:00

CPUMatrix.cpp

Align randomization across platforms

2016-12-21 17:31:22 +01:00

CPUMatrix.h

Align randomization across platforms

2016-12-21 17:31:22 +01:00

CPURNGHandle.cpp

Align randomization across platforms

2016-12-21 17:31:22 +01:00

CPURNGHandle.h

Align randomization across platforms

2016-12-21 17:31:22 +01:00

CPUSparseMatrix.cpp

Add trace for nodes with abs sum to make compare easier, with trace level 1000

2017-01-05 13:09:12 -08:00

CPUSparseMatrix.h

Fix sparse matrix MultiplyAndAdd bug that old value may not be kept

2016-12-06 23:49:01 -08:00

CUDAPageLockedMemAllocator.cpp

Fix path differing only in case. Fix hard tabs.

2016-09-19 09:13:31 +02:00

CUDAPageLockedMemAllocator.h

Re-format code using clang-format (plus some post-processing)

2016-01-18 09:36:14 +01:00

CntkBatchNormalization.cuh

CntkBatchNormalization.cuh: fix for batchSize == 1

2016-08-25 15:37:23 +02:00

ColumnQuantizer.h

tuned >1-bit SGD: odd #quantization levels, range now 4 stddevs (before: 5)

2016-08-21 01:33:01 -07:00

CommonMatrix.h

added TempDeviceBuffer to CommonMatrix.h

2016-11-19 21:53:24 +01:00

Convolution.cuh

Addressed BS CR comments

2016-10-04 14:55:10 +02:00

ConvolutionEngine.cpp

CNTK core: Implemented an optimization to elide the initial zeroing and subsequent accumulation into gradients for nodes with just one parent/accestor node

2016-12-12 12:06:17 -08:00

ConvolutionEngine.h

CNTK core: Implemented an optimization to elide the initial zeroing and subsequent accumulation into gradients for nodes with just one parent/accestor node

2016-12-12 12:06:17 -08:00

ConvolveGeometry.h

Fix autoPad issue using GetAutoPad().

2017-01-04 16:26:33 -08:00

CuDnnBatchNormalization.cu

CntkBatchNormalization.cuh: fix for batchSize == 1

2016-08-25 15:37:23 +02:00

CuDnnCommon.cpp

Changes Addressing Code Review for CUDNN RNNStack Node

2016-08-23 10:40:34 -07:00

CuDnnCommon.h

Address CR comments

2016-08-23 20:32:03 +02:00

CuDnnConvolutionEngine.cu

Fix autoPad issue using GetAutoPad().

2017-01-04 16:26:33 -08:00

CuDnnFactories.h

Introducing global flags

2016-09-01 10:26:18 +02:00

CuDnnRNN.cpp

Changes Addressing Code Review for CUDNN RNNStack Node

2016-08-23 10:40:34 -07:00

CuDnnRNN.h

(made gcc happy)

2016-08-24 19:51:02 -07:00

DataTransferer.cpp

First version of GPU prefetch

2016-09-21 12:00:04 +02:00

DataTransferer.h

First version of GPU prefetch

2016-09-21 12:00:04 +02:00

GPUDataTransferer.cpp

Remove statics from prefetch transferer

2016-12-07 11:57:48 +01:00

GPUDataTransferer.h

Remove statics from prefetch transferer

2016-12-07 11:57:48 +01:00

GPUMatrix.cu

cuda related vs2015 changes

2016-11-29 11:57:01 +01:00

GPUMatrix.h

Merge branch 'master' into v-yuxgu/memory-optv2

2016-11-08 16:14:12 +08:00

GPUMatrixCUDAKernels.cuh

Fix sparse matrix MultiplyAndAdd bug that old value may not be kept

2016-12-06 23:49:01 -08:00

GPURNGHandle.cu

Use 8 bytes for RNG(Handle) seed on both platforms.

2016-12-13 12:37:18 +01:00

GPURNGHandle.h

Use 8 bytes for RNG(Handle) seed on both platforms.

2016-12-13 12:37:18 +01:00

GPUSparseMatrix.cu

Add trace for nodes with abs sum to make compare easier, with trace level 1000

2017-01-05 13:09:12 -08:00

GPUSparseMatrix.h

fixes for dense times sparse

2016-11-19 21:53:24 +01:00

GPUTensor.cu

optimized DelayedValueNode by using non-syncing mask

2016-09-05 16:46:16 -07:00

GPUTensor.h

merged from master

2016-07-26 16:51:19 -07:00

GPUWatcher.cu

normalized inconsistent commenting style from "//text" to "// text"

2016-01-22 13:58:47 -08:00

GPUWatcher.h

License change

2016-01-18 09:36:17 +01:00

Helpers.h

Inlining checks

2016-06-21 17:38:54 +02:00

Math.vcxproj

Align randomization across platforms

2016-12-21 17:31:22 +01:00

Math.vcxproj.filters

Quantized multiplier v1

2016-11-27 21:25:26 -08:00

MathCUDA.vcxproj

.vcxproj: remove <BrowseInformation/> and <Bscmake/>

2016-10-18 10:29:54 +02:00

MathCUDA.vcxproj.filters

merge

2016-10-12 13:25:17 +02:00

Matrix.cpp

Add trace for nodes with abs sum to make compare easier, with trace level 1000

2017-01-05 13:09:12 -08:00

Matrix.h

CNTK V2 library: Added Value::Create method to batch sparse sequences

2016-12-16 01:27:22 -08:00

MatrixQuantizerCPU.cpp

tuned >1-bit SGD: odd #quantization levels, range now 4 stddevs (before: 5)

2016-08-21 01:33:01 -07:00

MatrixQuantizerCPU.h

White-space (remove CRs that crept in)

2016-01-25 10:24:10 +01:00

MatrixQuantizerGPU.cu

First version of distributed aggregator for v2

2016-10-19 13:49:10 +02:00

MatrixQuantizerGPU.h

normalized inconsistent commenting style from "//text" to "// text"

2016-01-22 13:58:47 -08:00

MatrixQuantizerImpl.cpp

Re-format code using clang-format (plus some post-processing)

2016-01-18 09:36:14 +01:00

MatrixQuantizerImpl.h

First version of all reduce aggregator

2016-10-24 19:25:36 +02:00

MatrixQuantizer_kernel.cu

normalized inconsistent commenting style from "//text" to "// text"

2016-01-22 13:58:47 -08:00

MemAllocator.h

Re-format code using clang-format (plus some post-processing)

2016-01-18 09:36:14 +01:00

NcclComm.cpp

Source/Math/NcclComm.cpp: tune logging

2016-11-09 14:00:53 +00:00

NcclComm.h

Added NCCL support to SimpleDistGradAggregator

2016-11-08 18:15:01 -08:00

NoGPU.cpp

Use 8 bytes for RNG(Handle) seed on both platforms.

2016-12-13 12:37:18 +01:00

QuantizedMatrix.cpp

addressing CR comments.

2016-11-23 23:12:32 +01:00

QuantizedMatrix.h

First version of all reduce aggregator

2016-10-24 19:25:36 +02:00

QuantizedOperations.h

Quantized multiplier v1

2016-11-27 21:25:26 -08:00

Quantizers.h

Quantized multiplier v1

2016-11-27 21:25:26 -08:00

RNGHandle.cpp

Use 8 bytes for RNG(Handle) seed on both platforms.

2016-12-13 12:37:18 +01:00

RNGHandle.h

Use 8 bytes for RNG(Handle) seed on both platforms.

2016-12-13 12:37:18 +01:00

RNNCommon.h

Incorporated CR feedback

2016-11-17 17:14:34 -08:00

TensorOps.h

ReduceLogSum: beautifications

2016-08-05 14:08:54 +02:00

TensorView.cpp

Quantized multiplier v1

2016-11-27 21:25:26 -08:00

TensorView.h

Quantized multiplier v1

2016-11-27 21:25:26 -08:00

ValueQuantizer.h

fix more warning descriptions

2016-11-23 12:01:31 +01:00

cudabasetypes.h

Re-format code using clang-format (plus some post-processing)

2016-01-18 09:36:14 +01:00

cudadevice.h

Removing unused dependencies

2016-06-21 17:38:54 +02:00

cudalattice.cpp

cuda related vs2015 changes

2016-11-29 11:57:01 +01:00

cudalattice.h

Re-format code using clang-format (plus some post-processing)

2016-01-18 09:36:14 +01:00

cudalatticeops.cu

Re-format code using clang-format (plus some post-processing)

2016-01-18 09:36:14 +01:00

cudalatticeops.cu.h

cuda related vs2015 changes

2016-11-29 11:57:01 +01:00

cudalatticeops.h

make parameter explicit by adding namespace

2016-11-23 10:39:35 +01:00

cudalib.cpp

normalized inconsistent commenting style from "//text" to "// text"

2016-01-22 13:58:47 -08:00

cudalib.h

Re-format code using clang-format (plus some post-processing)

2016-01-18 09:36:14 +01:00

dllmain.cpp

License change

2016-01-18 09:36:17 +01:00

latticefunctionskernels.h

reviewed uses of atomicAdd(),;

2016-05-05 11:02:21 -07:00

stdafx.cpp

License change

2016-01-18 09:36:17 +01:00

stdafx.h

renamed EsotericNodes.h to SpecialPurposeNodes.h, and CompositeComputationNodes.h to PreComputeNodes.h;

2016-01-22 10:45:53 -08:00

targetver.h

License change

2016-01-18 09:36:17 +01:00