зеркало из https://github.com/microsoft/DeepSpeed.git
69af361167
Hi. Please review the following changes I added support for BF16 to cpu adam. BF16, FP16 and float are supported at compilation time. the correct template is called at runtime according to input params dtype. --------- Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> |
||
---|---|---|
.. | ||
cpu_lion.cpp | ||
cpu_lion_impl.cpp | ||
fused_lion_frontend.cpp | ||
multi_tensor_apply.cuh | ||
multi_tensor_lion.cu |