David Addison
f40ce73e89
NCCL 2.4.6-1
...
Added detection of IBM/Power NVLink bridge device.
Add NUMA support to PCI distance calculations.
Added NCCL_IGNORE_CPU_AFFINITY env var.
Fix memory leaks; GithubIssue#180
Compiler warning fix; GithubIssue#178
Replace non-standard variable length arrays. GithubIssue#171
Fix Tree+Shared Memory crash. GithubPR#185
Fix LL cleanup hang during long running DL jobs.
Fix NCCL_RINGS environment variable handling.
Added extra checks to catch repeat calls to ncclCommDestroy() GithubIssue#191
Improve bootstrap socket connection reliability at scale.
Fix hostname hashing issue. GithubIssue#187
Code cleanup to rename all non device files from *.cu to *.cc
2019-04-05 13:05:45 -07:00
Sylvain Jeaugey
bed43524cc
Add install target
...
Fix issue #145
2018-11-05 09:53:59 -08:00
Sylvain Jeaugey
f93fe9bfd9
2.3.5-5
...
Add support for inter-node communication using sockets and InfiniBand/RoCE.
Improve latency.
Add support for aggregation.
Improve LL/regular tuning.
Remove tests as those are now at github.com/nvidia/nccl-tests .
2018-09-25 14:12:01 -07:00
Sylvain Jeaugey
29a1a916dc
Add support for CUDA9 half semantics
2017-06-14 11:20:24 -07:00
Boris Fomitchev
649f04d077
Added Pascal nvcc flags, bumped version
2017-03-24 11:58:14 -07:00
Sylvain Jeaugey
7fef264bfa
Bumping version to 1.3.3
2017-03-01 16:44:27 -08:00
Sylvain Jeaugey
34d27771c6
1.3.2 release
...
Broadcast tuning
Better checking of inputs
Copy/reduce code simplification
2016-12-01 15:17:50 -08:00
Peter Jin
5765d608cc
Add a static library target "staticlib" to the Makefile.
...
Rename the static library "libnccl_static.a" to disambiguate from the
dynamic libraries.
2016-11-24 11:31:03 -08:00
Kyle Fernandes, ne Jacobs
5f2b32e45b
Add Fortran bindings
2016-11-17 15:33:34 -08:00
Sylvain Jeaugey
534b9a1697
Bump to 1.3.1
2016-10-13 10:33:05 -07:00
Sylvain Jeaugey
ca330b110a
Add scan tests
2016-09-22 11:58:33 -07:00
Sylvain Jeaugey
cabd6848e4
Heavy code refactoring to remove a lot of code in collectives (~1000 lines).
...
Have all collectives use the same args, the same ring, and the same primitives for synchronization between threads with the same pattern.
2016-09-22 11:57:56 -07:00
Sylvain Jeaugey
9ee6189bf9
Merge pull request #41 from jia-kai/master
...
Some minor fixes for compile/usage
2016-09-15 09:45:52 -07:00
Sylvain Jeaugey
75bad643bd
Updated LICENCE.txt
2016-08-26 15:08:20 -07:00
jiakai
ed401cc29b
link library with -lrt; otherwise there is undefined reference to shm_open
2016-08-19 18:58:56 +08:00
Sylvain Jeaugey
7a1aa6b563
Improved Deb generation
2016-07-07 16:31:57 +02:00
Sylvain Jeaugey
9ae84f5d6b
Fix version number
2016-06-16 17:07:42 -07:00
Sylvain Jeaugey
e51e922924
Add a debug level to NCCL and CUDA versions at init
2016-06-16 17:04:41 -07:00
Sylvain Jeaugey
9fcc523485
Increased version to 1.2.3
2016-06-15 19:18:13 -07:00
Sylvain Jeaugey
67d1ab9106
Packaging : Generate shlibs.local
2016-06-15 19:03:08 -07:00
Sylvain Jeaugey
da6d2009e0
Move deb to build directory
2016-06-15 18:20:10 -07:00
Sylvain Jeaugey
155132d336
Fix make install to use BUILDDIR
2016-06-15 18:20:02 -07:00
Sylvain Jeaugey
08ddfe03d2
Rework debian packaging
2016-06-15 18:18:44 -07:00
Boris Fomitchev
177505b757
Gencodes changed to NV recommended
2016-06-06 00:06:18 -07:00
Sylvain Jeaugey
9d9d8cd59f
Bump to 1.2.2
2016-06-03 17:21:53 -07:00
Sylvain Jeaugey
1657af1567
Better name for GENCODE
2016-06-03 10:25:37 -07:00
Sylvain Jeaugey
889ad3d4e6
Makefile improvements
...
- Use standard CXX env var
- Permit redefinition of more env
- Separate lib from tests
2016-06-02 15:01:03 -07:00
Boris Fomitchev
0629fb62d7
Merge pull request #21 from borisfom/master
...
Fixed install location, new .deb version
2016-04-21 14:46:41 -07:00
Boris Fomitchev
0177cf3ea4
Fixed install location, new .deb version
2016-04-21 14:10:31 -07:00
Nathan Luehr
658aca1469
Merge pull request #17 from Hopobcn/master
...
Enable compilation with specific g++
2016-04-21 13:25:18 -07:00
Nathan Luehr
03df4c7759
Moved no-as-needed flag to link rule.
...
Avoids link errors for tests linked with nvcc.
2016-04-19 14:51:03 -07:00
Adam Paszke
c0c959b1be
Add --no-as-needed to make sure that cudart library gets liked
2016-04-13 10:04:38 -04:00
Pau Farré
e30bf95989
Enable compilation with old g++ when the default g++ is not supported (+5.0)
2016-04-12 12:49:13 +02:00
Boris Fomitchev
e6f4a83da6
Removing Tegra
2016-03-17 17:25:27 -07:00
Boris Fomitchev
62b551798f
Use arch=5.3 as well
2016-03-16 23:09:36 -07:00
Boris Fomitchev
92d2123d8d
Added compute 5.3
2016-03-16 19:24:48 -07:00
Boris Fomitchev
ec3de28ae5
Preparing for pbuild
2016-03-16 19:23:49 -07:00
Boris Fomitchev
941d9da08c
Updated package version, added manpage
2016-02-29 12:10:34 -08:00
Nathan Luehr
2758353380
Added NCCL error checking to tests.
...
Also cleaned up makefile so that tests and lib are not built unnecessarily.
Change-Id: Ia0c596cc2213628de2f066be97615c09bb1bb262
Reviewed-on: http://git-master/r/999627
Reviewed-by: Przemek Tredak <ptredak@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-29 11:09:05 -08:00
Sylvain Jeaugey
c05312f151
Moved tests to separate dir and improved MPI test
...
test sources moved to test/ directory.
MPI test displays PASS/FAIL and returns code accordingly.
Change-Id: I058ebd1bd5202d8f38cc9787898b2480100c102b
Reviewed-on: http://git-master/r/936086
Reviewed-by: Przemek Tredak <ptredak@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-28 12:56:36 -08:00
Nathan Luehr
90af7c73ef
Merge pull request #6 from lukeyeager/deb
...
Deb packaging
2016-01-07 13:06:28 -08:00
Luke Yeager
c9da89254b
Update deb packaging scripts
2015-12-18 14:23:34 -08:00
Nathan Luehr
eb2d869f71
Merge pull request #5 from lukeyeager/tests-nvml
...
Don't link tests with NVML
2015-12-18 13:36:20 -08:00
Boris Fomitchev
b5400c54df
Don't link tests with NVML
2015-12-18 13:27:55 -08:00
Nathan Luehr
a4de6016f8
Merge pull request #4 from lukeyeager/build-sm50
...
Build SM 5.0 code
2015-12-18 13:23:48 -08:00
Luke Yeager
dd0884b707
Build SM 5.0 code
2015-12-18 13:19:50 -08:00
Luke Yeager
e1634ca6cb
Use semantic versioning
2015-12-18 12:02:17 -08:00
Nathan Luehr
0673d5f44f
Initial release.
2015-11-17 11:30:40 -08:00