DeepSpeed/docs/_pages
Guanhua Wang b1cb0dfc46
Guanhua/partial offload rebase v2 (#590) (#4636)
This PR introduces Twin-Flow feature of ZeRO-Offload++, which improves
e2e training iteration time by up to 6x on DGX-H100s.

 This PR includes:

* Twin-Flow implementation inside ZeRO optimizer
* json config tutorial
* example using deepspeed
* unit tests


cc @jeffra @awan-10 @tjruwase @mrwyattii

Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2023-11-06 14:15:16 -08:00
..
compression.md [docs] website refresh (#2123) 2022-07-21 16:56:17 -07:00
config-json.md Guanhua/partial offload rebase v2 (#590) (#4636) 2023-11-06 14:15:16 -08:00
deepspeed4science.md add DeepSpeed4Science white paper (#4502) 2023-10-11 15:36:06 -07:00
inference.md [docs] website refresh (#2123) 2022-07-21 16:56:17 -07:00
posts-landing.md Website posts and tutorial improvements (#1799) 2022-03-11 15:00:32 -08:00
posts_list_landing.md Website posts and tutorial improvements (#1799) 2022-03-11 15:00:32 -08:00
training.md Typo Correction (#3621) 2023-05-31 11:00:57 -07:00
tutorials-landing.md Website posts and tutorial improvements (#1799) 2022-03-11 15:00:32 -08:00