caffe/docs/performance_hardware.md

62 строки
2.0 KiB
Markdown
Исходник Обычный вид История

2014-04-03 03:59:45 +04:00
---
2014-09-04 04:44:28 +04:00
title: Performance and Hardware Configuration
2014-04-03 03:59:45 +04:00
---
# Performance and Hardware Configuration
2014-04-03 03:59:45 +04:00
To measure performance on different NVIDIA GPUs we use the Caffe reference ImageNet model.
2014-04-03 03:59:45 +04:00
For training, each time point is 20 iterations/minibatches of 256 images for 5,120 images total. For testing, a 50,000 image validation set is classified.
2014-04-03 03:59:45 +04:00
**Acknowledgements**: BVLC members are very grateful to NVIDIA for providing several GPUs to conduct this research.
2014-04-03 03:59:45 +04:00
## NVIDIA K40
2014-04-03 03:59:45 +04:00
Performance is best with ECC off and boost clock enabled. While ECC makes a negligible difference in speed, disabling it frees ~1 GB of GPU memory.
2014-04-03 03:59:45 +04:00
Best settings with ECC off and maximum clock speed:
2014-04-03 03:59:45 +04:00
* Training is 26.5 secs / 20 iterations (5,120 images)
* Testing is 100 secs / validation set (50,000 images)
2014-04-03 03:59:45 +04:00
Other settings:
2014-04-03 03:59:45 +04:00
* ECC on, max speed: training 26.7 secs / 20 iterations, test 101 secs / validation set
* ECC on, default speed: training 31 secs / 20 iterations, test 117 secs / validation set
* ECC off, default speed: training 31 secs / 20 iterations, test 118 secs / validation set
2014-04-03 03:59:45 +04:00
### K40 configuration tips
2014-04-03 03:59:45 +04:00
For maximum K40 performance, turn off ECC and boost the clock speed (at your own risk).
2014-04-03 03:59:45 +04:00
To turn off ECC, do
2014-04-03 03:59:45 +04:00
sudo nvidia-smi -i 0 --ecc-config=0 # repeat with -i x for each GPU ID
2014-04-03 03:59:45 +04:00
then reboot.
Set the "persistence" mode of the GPU settings by
sudo nvidia-smi -pm 1
and then set the clock speed with
sudo nvidia-smi -i 0 -ac 3004,875 # repeat with -i x for each GPU ID
but note that this configuration resets across driver reloading / rebooting. Include these commands in a boot script to intialize these settings. For a simple fix, add these commands to `/etc/rc.local` (on Ubuntu).
## NVIDIA Titan
Training: 26.26 secs / 20 iterations (5,120 images).
Testing: 100 secs / validation set (50,000 images).
## NVIDIA K20
Training: 36.0 secs / 20 iterations (5,120 images).
Testing: 133 secs / validation set (50,000 images)
## NVIDIA GTX 770
Training: 33.0 secs / 20 iterations (5,120 images).
Testing: 129 secs / validation set (50,000 images)