Граф коммитов

48 Коммитов

Автор SHA1 Сообщение Дата
David Addison f40ce73e89 NCCL 2.4.6-1
Added detection of IBM/Power NVLink bridge device.
    Add NUMA support to PCI distance calculations.
    Added NCCL_IGNORE_CPU_AFFINITY env var.
    Fix memory leaks; GithubIssue#180
    Compiler warning fix; GithubIssue#178
    Replace non-standard variable length arrays. GithubIssue#171
    Fix Tree+Shared Memory crash. GithubPR#185
    Fix LL cleanup hang during long running DL jobs.
    Fix NCCL_RINGS environment variable handling.
    Added extra checks to catch repeat calls to ncclCommDestroy() GithubIssue#191
    Improve bootstrap socket connection reliability at scale.
    Fix hostname hashing issue. GithubIssue#187
    Code cleanup to rename all non device files from *.cu to *.cc
2019-04-05 13:05:45 -07:00
Sylvain Jeaugey bed43524cc Add install target
Fix issue #145
2018-11-05 09:53:59 -08:00
Sylvain Jeaugey f93fe9bfd9 2.3.5-5
Add support for inter-node communication using sockets and InfiniBand/RoCE.
Improve latency.
Add support for aggregation.
Improve LL/regular tuning.
Remove tests as those are now at github.com/nvidia/nccl-tests .
2018-09-25 14:12:01 -07:00
Sylvain Jeaugey 29a1a916dc Add support for CUDA9 half semantics 2017-06-14 11:20:24 -07:00
Boris Fomitchev 649f04d077 Added Pascal nvcc flags, bumped version 2017-03-24 11:58:14 -07:00
Sylvain Jeaugey 7fef264bfa Bumping version to 1.3.3 2017-03-01 16:44:27 -08:00
Sylvain Jeaugey 34d27771c6 1.3.2 release
Broadcast tuning
Better checking of inputs
Copy/reduce code simplification
2016-12-01 15:17:50 -08:00
Peter Jin 5765d608cc Add a static library target "staticlib" to the Makefile.
Rename the static library "libnccl_static.a" to disambiguate from the
dynamic libraries.
2016-11-24 11:31:03 -08:00
Kyle Fernandes, ne Jacobs 5f2b32e45b Add Fortran bindings 2016-11-17 15:33:34 -08:00
Sylvain Jeaugey 534b9a1697 Bump to 1.3.1 2016-10-13 10:33:05 -07:00
Sylvain Jeaugey ca330b110a Add scan tests 2016-09-22 11:58:33 -07:00
Sylvain Jeaugey cabd6848e4 Heavy code refactoring to remove a lot of code in collectives (~1000 lines).
Have all collectives use the same args, the same ring, and the same primitives for synchronization between threads with the same pattern.
2016-09-22 11:57:56 -07:00
Sylvain Jeaugey 9ee6189bf9 Merge pull request #41 from jia-kai/master
Some minor fixes for compile/usage
2016-09-15 09:45:52 -07:00
Sylvain Jeaugey 75bad643bd Updated LICENCE.txt 2016-08-26 15:08:20 -07:00
jiakai ed401cc29b link library with -lrt; otherwise there is undefined reference to shm_open 2016-08-19 18:58:56 +08:00
Sylvain Jeaugey 7a1aa6b563 Improved Deb generation 2016-07-07 16:31:57 +02:00
Sylvain Jeaugey 9ae84f5d6b Fix version number 2016-06-16 17:07:42 -07:00
Sylvain Jeaugey e51e922924 Add a debug level to NCCL and CUDA versions at init 2016-06-16 17:04:41 -07:00
Sylvain Jeaugey 9fcc523485 Increased version to 1.2.3 2016-06-15 19:18:13 -07:00
Sylvain Jeaugey 67d1ab9106 Packaging : Generate shlibs.local 2016-06-15 19:03:08 -07:00
Sylvain Jeaugey da6d2009e0 Move deb to build directory 2016-06-15 18:20:10 -07:00
Sylvain Jeaugey 155132d336 Fix make install to use BUILDDIR 2016-06-15 18:20:02 -07:00
Sylvain Jeaugey 08ddfe03d2 Rework debian packaging 2016-06-15 18:18:44 -07:00
Boris Fomitchev 177505b757 Gencodes changed to NV recommended 2016-06-06 00:06:18 -07:00
Sylvain Jeaugey 9d9d8cd59f Bump to 1.2.2 2016-06-03 17:21:53 -07:00
Sylvain Jeaugey 1657af1567 Better name for GENCODE 2016-06-03 10:25:37 -07:00
Sylvain Jeaugey 889ad3d4e6 Makefile improvements
- Use standard CXX env var
 - Permit redefinition of more env
 - Separate lib from tests
2016-06-02 15:01:03 -07:00
Boris Fomitchev 0629fb62d7 Merge pull request #21 from borisfom/master
Fixed install location, new .deb version
2016-04-21 14:46:41 -07:00
Boris Fomitchev 0177cf3ea4 Fixed install location, new .deb version 2016-04-21 14:10:31 -07:00
Nathan Luehr 658aca1469 Merge pull request #17 from Hopobcn/master
Enable compilation with specific g++
2016-04-21 13:25:18 -07:00
Nathan Luehr 03df4c7759 Moved no-as-needed flag to link rule.
Avoids link errors for tests linked with nvcc.
2016-04-19 14:51:03 -07:00
Adam Paszke c0c959b1be Add --no-as-needed to make sure that cudart library gets liked 2016-04-13 10:04:38 -04:00
Pau Farré e30bf95989 Enable compilation with old g++ when the default g++ is not supported (+5.0) 2016-04-12 12:49:13 +02:00
Boris Fomitchev e6f4a83da6 Removing Tegra 2016-03-17 17:25:27 -07:00
Boris Fomitchev 62b551798f Use arch=5.3 as well 2016-03-16 23:09:36 -07:00
Boris Fomitchev 92d2123d8d Added compute 5.3 2016-03-16 19:24:48 -07:00
Boris Fomitchev ec3de28ae5 Preparing for pbuild 2016-03-16 19:23:49 -07:00
Boris Fomitchev 941d9da08c Updated package version, added manpage 2016-02-29 12:10:34 -08:00
Nathan Luehr 2758353380 Added NCCL error checking to tests.
Also cleaned up makefile so that tests and lib are not built unnecessarily.

Change-Id: Ia0c596cc2213628de2f066be97615c09bb1bb262
Reviewed-on: http://git-master/r/999627
Reviewed-by: Przemek Tredak <ptredak@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-29 11:09:05 -08:00
Sylvain Jeaugey c05312f151 Moved tests to separate dir and improved MPI test
test sources moved to test/ directory.
MPI test displays PASS/FAIL and returns code accordingly.

Change-Id: I058ebd1bd5202d8f38cc9787898b2480100c102b
Reviewed-on: http://git-master/r/936086
Reviewed-by: Przemek Tredak <ptredak@nvidia.com>
Tested-by: Przemek Tredak <ptredak@nvidia.com>
2016-01-28 12:56:36 -08:00
Nathan Luehr 90af7c73ef Merge pull request #6 from lukeyeager/deb
Deb packaging
2016-01-07 13:06:28 -08:00
Luke Yeager c9da89254b Update deb packaging scripts 2015-12-18 14:23:34 -08:00
Nathan Luehr eb2d869f71 Merge pull request #5 from lukeyeager/tests-nvml
Don't link tests with NVML
2015-12-18 13:36:20 -08:00
Boris Fomitchev b5400c54df Don't link tests with NVML 2015-12-18 13:27:55 -08:00
Nathan Luehr a4de6016f8 Merge pull request #4 from lukeyeager/build-sm50
Build SM 5.0 code
2015-12-18 13:23:48 -08:00
Luke Yeager dd0884b707 Build SM 5.0 code 2015-12-18 13:19:50 -08:00
Luke Yeager e1634ca6cb Use semantic versioning 2015-12-18 12:02:17 -08:00
Nathan Luehr 0673d5f44f Initial release. 2015-11-17 11:30:40 -08:00