DeepSpeed/csrc
Joe Mayer b692cdea47
AIO File Offsets (#6641)
Adding the option for a file offset to the read/write functions of AIO &
GDS ops.

---------

Co-authored-by: jomayeri <deepspeed@H100-VM2.shlnn55tgwve1eacvp21ie45dg.jx.internal.cloudapp.net>
Co-authored-by: Masahiro Tanaka <81312776+tohtana@users.noreply.github.com>
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
2024-11-12 16:34:17 +00:00
..
adagrad Fix Type Name Inconsistency & Typo in cpu_adam (#6732) 2024-11-11 23:31:45 +00:00
adam Fix Type Name Inconsistency & Typo in cpu_adam (#6732) 2024-11-11 23:31:45 +00:00
aio AIO File Offsets (#6641) 2024-11-12 16:34:17 +00:00
cpu [CPU] Allow deepspeed.comm.inference_all_reduce in torch.compile graph (#5604) 2024-07-15 22:24:11 +00:00
deepspeed4science/evoformer_attn Update clang-format version from 16 to 18. (#5839) 2024-08-06 09:14:21 -07:00
fp_quantizer wrap include cuda_bf16.h with ifdef BF16_AVAILABLE (#6520) 2024-09-10 16:08:50 +00:00
gds AIO File Offsets (#6641) 2024-11-12 16:34:17 +00:00
includes Fix Type Name Inconsistency & Typo in cpu_adam (#6732) 2024-11-11 23:31:45 +00:00
lamb Switch from HIP_PLATFORM_HCC to HIP_PLATFORM_AMD (#4539) 2023-10-19 21:01:48 +00:00
lion Fix Type Name Inconsistency & Typo in cpu_adam (#6732) 2024-11-11 23:31:45 +00:00
quantization Fixed the Windows build. (#5596) 2024-05-31 22:11:10 +00:00
random_ltd Rocm warp size fix (#5402) 2024-05-17 20:35:58 +00:00
sparse_attention
spatial Switch from HIP_PLATFORM_HCC to HIP_PLATFORM_AMD (#4539) 2023-10-19 21:01:48 +00:00
transformer [Bug Fix] Support threads_per_head < 64 for wavefront size of 64 (#6622) 2024-11-04 21:51:27 +00:00
utils
xpu [XPU] [DeepNVMe] use same cpu_op_desc_t with cuda (#6645) 2024-10-22 14:45:05 +00:00