DeepSpeed

Граф коммитов

Автор	SHA1	Сообщение	Дата
Logan Adams	297a6840e1	Update clang-format version from 16 to 18. (#5839 ) We used a slightly old version of clang-format before, this caused issues when folks installed the latest via apt or similar rather than python to try and fix their formatting issues. Plus installing older versions is a pain and the formatting style of the newer version seems better?	2024-08-06 09:14:21 -07:00
baodi	e39229676c	update xpu fusedadam opbuilder for pytorch 2.3 (#5702 ) update the way to get queue for FusedAdam OpBuilder. --------- Signed-off-by: baodii <di.bao@intel.com> Co-authored-by: Logan Adams <loadams@microsoft.com> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>	2024-07-01 12:34:11 -07:00
Liangliang-Ma	4b8a4a0729	Change source of CPUAdam for xpu accelerator (#5703 ) Noted that cpu adam for cuda/cpu accelerator has removed the dependency of CUDA, we can now use the same source.	2024-06-28 12:50:36 -07:00
Liangliang-Ma	11a62a0635	Add Compressedbackend for Onebit optimizers (#5473 ) In the process of adding onebit optimizers support for XPU devices, we have noticed that for different accelerator, the main difference of implementation of `compressed_allreduce` lies on `packbits` and `unpackbits`. CUDA uses cupy and NPU uses torch_npu. Instead of replace these to xpu only functions, we provided a CompressedBackend to do the `compressed_allreduce` work where users can add their own packbits/unpackbits kernels, which is a general path for all kinds of accelerators. In this PR, we: 1. Add CompressedBackend for onebitAdam, onebitLamb and zerooneAdam 2. Add XPU implement of packbits/unpackbits with SYCL, built in PackbitsBuilder 3. Add tests for onebit with CompressedBackend --------- Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>	2024-06-05 20:28:46 +00:00
Ma, Guokai	f4f31317ed	[XPU] XPU accelerator support for Intel GPU device (#4547 ) This PR includes XPU support for Intel GPU. With this PR, DeepSpeed can support XPU devices without install Intel Extension for DeepSpeed. --------- Co-authored-by: Liangliang-Ma <1906710196@qq.com> Co-authored-by: baodi <di.bao@intel.com> Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com> Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by: Yizhou Wang <yizhou.wang@intel.com> Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>	2024-01-05 12:29:07 -08:00

5 Коммитов