Jeff Rasley
|
e5bbc2e559
|
Sparse attn + ops/runtime refactor + v0.3.0 (#343)
* Sparse attn + ops/runtime refactor + v0.3.0
Co-authored-by: Arash Ashari <arashari@microsoft.com>
Co-authored-by: Arash Ashari <arashari@microsoft.com>
|
2020-09-01 18:06:15 -07:00 |
Jeff Rasley
|
e8dd47df26
|
Update .gitignore
|
2020-08-31 21:15:50 -07:00 |
Jeff Rasley
|
f2ac7eafd5
|
ZeRO-2 (#217)
Updates for ZeRO stage 2 + ZeRO stage 1 w. RS
Co-authored-by: Tunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Samyam Rajbhandari <samyamr@microsoft.com>
Co-authored-by: Shaden Smith <ShadenTSmith@gmail.com>
Co-authored-by: Elton Zheng <eltonz@microsoft.com>
Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com>
Co-authored-by: yuxionghe <yuxhe@microsoft.com>
Co-authored-by: Arash Ashari <arashari@microsoft.com>
|
2020-05-19 01:00:53 -07:00 |
Shaden Smith
|
dd166ee6b6
|
README and RTD improvements. (#198)
|
2020-04-21 22:18:47 -07:00 |
Shaden Smith
|
a76572dc7c
|
Adding static loss scaling for ZeRO. (#166)
|
2020-03-25 09:34:27 -07:00 |
Shaden Smith
|
5042dc0085
|
drafting Jekyll webpage (#143)
|
2020-03-17 13:49:48 -07:00 |
Shaden Smith
|
010f6dc0cf
|
Updating .gitignore (#55)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
|
2020-02-10 07:07:37 -08:00 |
Jeff Rasley
|
e63b6b0113
|
add gitignore
|
2020-01-31 16:06:44 -08:00 |