Olli Saarikivi
1306d086b5
Merge pull request #6 from parasailteam/olli/interpreter-template
...
Extract shared parts of interpreter code into SCKLFunction template class.
2021-04-09 13:01:29 -07:00
Olli Saarikivi
9327d76850
Extract shared parts of interpreter code
...
Simple, LL and LL128 specify the differing parts with wrappers for the
*Primitives structs.
A SCKLFunction class template contains most of the interpreter logic.
2021-04-09 02:52:41 -07:00
Saeed Maleki
252a9773a1
has_dependence
2021-04-09 00:56:21 +00:00
Saeed Maleki
d83e777ac3
all protocols work
2021-04-08 23:37:17 +00:00
Saeed Maleki
007fc390c7
added sckl_interpreter
2021-04-08 06:20:41 +00:00
Saeed Maleki
1da0956a4a
works
2021-04-08 06:01:36 +00:00
Saeed Maleki
8d8bc3c201
more progress
2021-04-07 21:04:43 +00:00
Saeed Maleki
0d1af8e26c
added allgather support
2021-04-07 18:30:47 +00:00
Saeed Maleki
308b5abb88
not complete
2021-04-06 21:35:02 +00:00
Saeed Maleki
e785648283
Merge pull request #5 from parasailteam/channel_management
...
Channel management
2021-04-06 10:52:48 -07:00
Saeed Maleki
5026d295e4
works across node as well
2021-04-06 06:04:14 +00:00
Saeed Maleki
2a7caa2bd0
works within a node
2021-04-05 23:50:48 +00:00
Saeed Maleki
dea2ed3c1f
bug fixes
2021-04-05 21:04:31 +00:00
Saeed Maleki
1968e78ce4
partially working
2021-04-05 20:24:09 +00:00
Saeed Maleki
4788b47b4d
debugging
2021-04-05 18:31:39 +00:00
Saeed Maleki
90f3c3cf0d
fixed a bug regarding rid
2021-04-03 03:27:55 +00:00
Saeed Maleki
d01feb2275
compilable
2021-04-03 03:03:37 +00:00
Saeed Maleki
d3dcbe7495
spacing fixes
2021-04-02 17:33:26 +00:00
Saeed Maleki
7e872304a1
Merge pull request #4 from parasailteam/cross_threadblocks
...
Cross threadblocks
2021-04-01 22:52:12 -07:00
Saeed Maleki
295c627e7f
added a fence before the flag write
2021-04-02 05:48:58 +00:00
Saeed Maleki
f2cf4f2858
fixed madan's comment regarding 3-tuple. also added a guard to run sckl async only when all are sckl algorithms
2021-04-02 02:51:34 +00:00
Saeed Maleki
c29eca1568
added padding for sckl flags
2021-04-01 19:19:36 +00:00
Saeed Maleki
2cb6511754
added a check for NET_SHARED_BUFFERS
2021-04-01 18:59:09 +00:00
Saeed Maleki
3d60bd12d8
compilable version
2021-04-01 04:07:01 +00:00
Saeed Maleki
b425f1cd91
Merge pull request #3 from parasailteam/across_node_setup
...
Across node setup
2021-03-31 18:00:08 -07:00
Saeed Maleki
80ba01581c
fixed spacing issues
2021-04-01 00:59:25 +00:00
Saeed Maleki
7af77c04b8
done on 64 gpus
2021-03-31 01:17:19 +00:00
Saeed Maleki
f7837e5614
works/msrhyper-weka/saemal/mpi/bin/mpirun -np 16 -N 8 -hostfile /job/hostfile --tag-output -x NCCL_NET_SHARED_BUFFERS=0 -x NCCL_MIN_NCHANNELS=8 -x NCCL_MAX_NCHANNELS=8 -x NCCL_DEBUG=INFO -x SCKL_XML_FILE=/msrhyper-weka/saemal/test2.xml -x NCCL_ALGO=SCKL -x NCCL_PROTO=Simple -x LD_LIBRARY_PATH=/msrhyper-weka/saemal/nccl-master/build/lib/ /msrhyper-weka/saemal/nccl-tests/build/alltoall_perf -w 10 -n 10 -b 1024 -e 1048576000 -f 2 -c 1 -g 1 -z 0
2021-03-30 23:19:51 +00:00
Saeed Maleki
944b1f2880
current state
2021-03-30 02:01:48 +00:00
Saeed Maleki
21d2e89564
fixed a bug regarding across node connection setup
2021-03-29 21:11:47 +00:00
Saeed Maleki
9a7af96ee3
a printf for NCCLCHECK
2021-03-27 00:59:54 +00:00
Saeed Maleki
b3dc643313
fixed a bug
2021-03-26 05:27:35 +00:00
Saeed Maleki
dabb6570e8
testing
2021-03-26 03:42:19 +00:00
Saeed Maleki
3694d84747
for all to all we need to directly copy from sendbuffer to recvbuffer for my rank portion
2021-03-25 20:53:54 +00:00
Saeed Maleki
7f55e284f3
done! it works
2021-03-25 08:01:51 +00:00
Saeed Maleki
36194ff2e4
working version within a node
2021-03-25 06:55:30 +00:00
Saeed Maleki
b3dcc0d081
works within a node
2021-03-25 06:13:14 +00:00
Saeed Maleki
b63029c21b
deadlocks
2021-03-24 23:12:32 +00:00
Saeed Maleki
0d51c43749
more progress
2021-03-24 18:20:30 +00:00
Saeed Maleki
781bb7b302
ok version
2021-03-24 00:34:32 +00:00
Saeed Maleki
0959ebafc1
ok version
2021-03-24 00:33:15 +00:00
Saeed Maleki
99ad39b01b
more progress
2021-03-23 23:23:36 +00:00
Saeed Maleki
08d3ea9909
not compilable
2021-03-23 06:48:26 +00:00
Saeed Maleki
62169be637
done with intra communication
2021-03-19 00:05:49 +00:00
Saeed Maleki
9581e3d1c3
finished developing
2021-03-18 22:50:56 +00:00
Saeed Maleki
102a204327
Merge pull request #2 from parasailteam/connection_setup
...
Connection setup
2021-03-17 15:31:21 -07:00
Saeed Maleki
d12fca78a9
added one more comment
2021-03-17 22:30:54 +00:00
Saeed Maleki
e276503b26
adding a new line
2021-03-17 22:24:06 +00:00
Saeed Maleki
b69b341fca
one more naming change
2021-03-17 22:21:34 +00:00
Saeed Maleki
ee8e4c9d12
added some guards for corner cases
2021-03-17 19:37:57 +00:00