..
__init__.py
Add API for updating ZeRO gradients ( #6590 )
2024-10-14 17:35:41 +00:00
bwc.py
Support MoE for pipeline models ( #5338 )
2024-04-08 15:35:53 +00:00
comms_logging.py
fix comm logging for inference ( #4043 )
2023-07-27 17:52:05 +00:00
config.py
Add throughput timer configuration ( #5363 )
2024-05-22 20:28:02 +00:00
debug.py
Remove hooks on gradient accumulation on engine/optimizer destroy ( #4858 )
2023-12-30 02:48:49 +00:00
exceptions.py
Update DeepSpeed copyright license to Apache 2.0 ( #3111 )
2023-03-30 17:14:38 -07:00
groups.py
sequence parallel for uneven heads ( #6392 )
2024-10-25 18:26:47 +00:00
init_on_device.py
Update DeepSpeed copyright license to Apache 2.0 ( #3111 )
2023-03-30 17:14:38 -07:00
logging.py
Add fallback for is_compiling ( #6663 )
2024-10-25 20:47:22 +00:00
mixed_precision_linkage.py
Add API for updating ZeRO gradients ( #6590 )
2024-10-14 17:35:41 +00:00
numa.py
Safe usage of popen ( #6490 )
2024-09-04 21:06:04 +00:00
nvtx.py
Disable nvtx decorator to avoid graph break ( #5697 )
2024-06-26 22:30:56 +00:00
tensor_fragment.py
Add API for updating ZeRO gradients ( #6590 )
2024-10-14 17:35:41 +00:00
timer.py
Improve DS logging control ( #6602 )
2024-10-08 18:38:51 +00:00
torch.py
Cleanup required_torch_version code and references. ( #5370 )
2024-04-10 15:39:24 +00:00
types.py
Hybrid Engine Refactor and Llama Inference Support ( #3425 )
2023-05-03 17:20:07 -07:00
z3_leaf_module.py
Delay reduce-scatter for ZeRO3 leaf modules ( #5008 )
2024-02-02 13:44:08 -08:00
zero_to_fp32.py
Support safetensors export ( #6579 )
2024-10-15 11:22:31 +00:00