CNTK/Documentation/current_iteration.md

# CNTK Current Iteration

## Efficient group convolution
The implementation of group convolution in CNTK has been updated. The updated implementation moves away from creating a sub-graph for group convolution (using slicing and splicing), and instead uses cuDNN7 and MKL2017 APIs directly. This improves the experience both in terms of performance and model size. 

As an example, for a single group convolution op with the following attributes:

- Input tensor (C, H, W) = (32, 128, 128)
- Number of output channels = 32 (channel multiplier is 1)
- Groups = 32 (depth wise convolution)
- Kernel size = (5, 5)

The comparison numbers for this single node are as follows:

| First Header  | GPU exec. time (in millisec., 1000 run avg.) | CPU exec. time (in millisec., 1000 run avg.) | Model Size (in KB, CNTK format)
| ------------- | ------------- | ------------- | ------------- |
| Old implementation  | 9.349  | 41.921  | 38  |
| New implementation  | 6.581  | 9.963  | 5  |
| Speedup/savings	Approx.  | 30%	Approx.  | 65-75%	Approx.  | 87% |

## Operators


## Bug fixes


## ONNX
### Updates
- Updated CNTK's ONNX BatchNormalization op export/import to latest spec.

### Bug or minor fixes:


## Misc
Node timing and profile details format in chrome://tracing. Working example in ./Examples/Image/Classification/MLP/Python/SimpleMNIST.py Note that node timing would be added to profiler details when profiler is enabled, i.e. import cntk as C C.debugging.debug.set_node_timing(True) C.debugging.start_profiler() C.debugging.enable_profiler() trainer\|evaluator\|function executions trainer\|evaluator\|function.print_node_timing() C.debugging.stop_profiler() 2018-02-02 08:53:46 +03:00			`# CNTK Current Iteration`
First draft of current iteration notes. 2017-08-10 04:14:00 +03:00
Added group convolution to current_iteration .md 2018-04-17 23:34:14 +03:00			`## Efficient group convolution`
			`The implementation of group convolution in CNTK has been updated. The updated implementation moves away from creating a sub-graph for group convolution (using slicing and splicing), and instead uses cuDNN7 and MKL2017 APIs directly. This improves the experience both in terms of performance and model size.`

			`As an example, for a single group convolution op with the following attributes:`

			`- Input tensor (C, H, W) = (32, 128, 128)`
			`- Number of output channels = 32 (channel multiplier is 1)`
			`- Groups = 32 (depth wise convolution)`
			`- Kernel size = (5, 5)`

			`The comparison numbers for this single node are as follows:`

			`\| First Header \| GPU exec. time (in millisec., 1000 run avg.) \| CPU exec. time (in millisec., 1000 run avg.) \| Model Size (in KB, CNTK format)`
			`\| ------------- \| ------------- \| ------------- \| ------------- \|`
			`\| Old implementation \| 9.349 \| 41.921 \| 38 \|`
			`\| New implementation \| 6.581 \| 9.963 \| 5 \|`
			`\| Speedup/savings Approx. \| 30% Approx. \| 65-75% Approx. \| 87% \|`
Adding distributed GAN example 2018-03-14 02:51:06 +03:00
Updating release notes with ONNX and other feature work. 2018-03-15 03:22:54 +03:00			`## Operators`
Added group convolution to current_iteration .md 2018-04-17 23:34:14 +03:00
Updating release notes with ONNX and other feature work. 2018-03-15 03:22:54 +03:00
Fix Tutorial 201B for convergence issue. 2018-02-14 21:01:31 +03:00			`## Bug fixes`
Added group convolution to current_iteration .md 2018-04-17 23:34:14 +03:00
Updating release notes with ONNX and other feature work. 2018-03-15 03:22:54 +03:00
			`## ONNX`
			`### Updates`
Added group convolution to current_iteration .md 2018-04-17 23:34:14 +03:00			`- Updated CNTK's ONNX BatchNormalization op export/import to latest spec.`
iteration update 2018-03-15 18:47:22 +03:00
Updating release notes with ONNX and other feature work. 2018-03-15 03:22:54 +03:00			`### Bug or minor fixes:`
Added group convolution to current_iteration .md 2018-04-17 23:34:14 +03:00
Updating release notes with ONNX and other feature work. 2018-03-15 03:22:54 +03:00
			`## Misc`
Added group convolution to current_iteration .md 2018-04-17 23:34:14 +03:00