DeepSpeed/op_builder
Abhishek Kulkarni 1baf68840f
Update minor CUDA version compatibility (#5591)
Add CUDA versions 12.4 and 12.5 to the list
2024-05-31 16:47:35 +00:00
..
cpu CPUAdam fp16 and bf16 support (#5409) 2024-05-20 12:50:20 +00:00
hpu CPUAdam fp16 and bf16 support (#5409) 2024-05-20 12:50:20 +00:00
npu [NPU] Add NPU to support hybrid engine (#4831) 2024-02-01 14:02:27 -08:00
xpu Make op builder detection adapt to accelerator change (#5206) 2024-03-12 20:48:29 +00:00
__init__.py add type checker ignore to resolve that pylance can't resolved noqa annotation (#4102) 2023-08-08 13:30:23 +00:00
all_ops.py Make op builder detection adapt to accelerator change (#5206) 2024-03-12 20:48:29 +00:00
async_io.py move torch import (#4468) 2023-10-09 01:33:17 +00:00
builder.py Update minor CUDA version compatibility (#5591) 2024-05-31 16:47:35 +00:00
cpu_adagrad.py CPUAdam fp16 and bf16 support (#5409) 2024-05-20 12:50:20 +00:00
cpu_adam.py CPUAdam fp16 and bf16 support (#5409) 2024-05-20 12:50:20 +00:00
cpu_lion.py CPUAdam fp16 and bf16 support (#5409) 2024-05-20 12:50:20 +00:00
evoformer_attn.py fix multiple definition while building evoformer (#4556) 2023-10-26 21:48:07 +00:00
fp_quantizer.py Rename files in fp_quantize op from quantize.* to fp_quantize.* (#5577) 2024-05-28 20:01:48 +00:00
fused_adam.py Update DeepSpeed copyright license to Apache 2.0 (#3111) 2023-03-30 17:14:38 -07:00
fused_lamb.py Update DeepSpeed copyright license to Apache 2.0 (#3111) 2023-03-30 17:14:38 -07:00
fused_lion.py feat: add Lion optimizer (#4331) 2023-10-05 22:32:14 +00:00
inference_core_ops.py Fix the FP6 kernels compilation problem on non-Ampere GPUs. (#5333) 2024-04-15 17:23:28 +00:00
inference_cutlass_builder.py Isolate src code and testing for DeepSpeed-FastGen (#4610) 2023-11-07 12:03:44 -06:00
quantizer.py Enable quantizer op on ROCm (#4114) 2024-01-10 23:48:54 +00:00
ragged_ops.py Use ninja to speed up build (#5088) 2024-02-21 02:20:11 +00:00
ragged_utils.py Isolate src code and testing for DeepSpeed-FastGen (#4610) 2023-11-07 12:03:44 -06:00
random_ltd.py ROCm 6.0 prep changes (#4537) 2023-10-20 19:05:54 +00:00
sparse_attn.py Update torch version check in building sparse_attn (#3152) 2023-04-14 16:24:39 +00:00
spatial_inference.py Update DeepSpeed copyright license to Apache 2.0 (#3111) 2023-03-30 17:14:38 -07:00
stochastic_transformer.py Update DeepSpeed copyright license to Apache 2.0 (#3111) 2023-03-30 17:14:38 -07:00
transformer.py ROCm 6.0 prep changes (#4537) 2023-10-20 19:05:54 +00:00
transformer_inference.py Hybrid Engine Refactor and Llama Inference Support (#3425) 2023-05-03 17:20:07 -07:00