DeepSpeed/csrc
Reza Yazdani fd2f970bdf
Transformer-kernel - supporting any arbitrary sequence-length (#587)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-12-17 10:13:54 -08:00
..
adam tracking optimizer step in cpu-adam when loading checkpoint (#564) 2020-12-01 15:11:38 -08:00
includes tracking optimizer step in cpu-adam when loading checkpoint (#564) 2020-12-01 15:11:38 -08:00
lamb Transformer kernel release (#242) 2020-05-29 13:15:36 -07:00
sparse_attention Sparse attn + ops/runtime refactor + v0.3.0 (#343) 2020-09-01 18:06:15 -07:00
transformer Transformer-kernel - supporting any arbitrary sequence-length (#587) 2020-12-17 10:13:54 -08:00
utils DeepSpeed JIT op + PyPI support (#496) 2020-11-12 11:51:38 -08:00