.. |
autotuning
|
fix: change ==NONE to is (#3923)
|
2023-07-11 16:56:43 +00:00 |
checkpoint
|
Universal Checkpoint for Sequence Parallelism (#4752)
|
2023-12-15 10:22:39 -08:00 |
comm
|
Fix exception handling in get_all_ranks_from_group() function (#4862)
|
2023-12-22 11:48:48 -08:00 |
compression
|
fix: Remove duplicate word the (#4051)
|
2023-07-27 09:33:13 -07:00 |
elasticity
|
Re-enable elastic training for torch 2+ (#4010)
|
2023-07-21 18:21:35 +00:00 |
inference
|
Add support of Microsoft Phi-2 model to DeepSpeed-FastGen (#4812)
|
2024-01-03 13:27:06 -08:00 |
launcher
|
[NPU] load EXPORT_ENV based on different accelerators to support multi-node training on other devices (#4830)
|
2023-12-19 03:09:10 +00:00 |
model_implementations
|
Capture short kernel sequences to graph (#4318)
|
2023-12-20 20:51:36 +00:00 |
module_inject
|
enable starcode((kv_head=1)) autotp (#4896)
|
2024-01-05 20:54:15 +00:00 |
moe
|
Communication Optimization for Large-Scale Training (#4695)
|
2023-11-21 14:30:43 -08:00 |
monitor
|
Introduce pydantic_v1 compatibility module for pydantic>=2.0.0 support (#4407)
|
2023-10-09 11:59:30 -07:00 |
nebula
|
fix typo with deepspeed/ (#3547)
|
2023-06-02 00:47:14 +00:00 |
ops
|
use all_gather_into_tensor instead of all_gather (#4705)
|
2023-11-28 17:41:46 +00:00 |
pipe
|
Update DeepSpeed copyright license to Apache 2.0 (#3111)
|
2023-03-30 17:14:38 -07:00 |
profiling
|
Update flops profiler to recurse (#4374)
|
2023-11-21 18:48:58 +00:00 |
runtime
|
Support cpu tensors without direct device invocation (#3842)
|
2024-01-05 13:36:17 +00:00 |
sequence
|
Communication Optimization for Large-Scale Training (#4695)
|
2023-11-21 14:30:43 -08:00 |
utils
|
zero_to_fp32.py: Handle a case where shape doesn't have numel attr (#4842)
|
2024-01-03 19:45:06 +00:00 |
__init__.py
|
add type checker ignore to resolve that pylance can't resolved noqa annotation (#4102)
|
2023-08-08 13:30:23 +00:00 |
accelerator
|
Abstract accelerator (step 2) (#2560)
|
2023-01-06 23:40:58 -05:00 |
constants.py
|
Allow env var for timeout (#4405)
|
2023-10-10 08:56:10 -07:00 |
env_report.py
|
[CANN] Support cpu offload optimizer for Ascend NPU (#4568)
|
2023-11-14 13:37:16 +00:00 |
git_version_info.py
|
add type checker ignore to resolve that pylance can't resolved noqa annotation (#4102)
|
2023-08-08 13:30:23 +00:00 |
pydantic_v1.py
|
Introduce pydantic_v1 compatibility module for pydantic>=2.0.0 support (#4407)
|
2023-10-09 11:59:30 -07:00 |