Samyam Rajbhandari
0ad4fd880b
Update zero.md tutorial ( #495 )
...
* Update zero.md
Update to ZeRO tutorial to specify the use of activation checkpointing
* Update zero-offload.md
Use activation checkpointing with ZeRO-Offload
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-11-11 15:29:02 -08:00
Olatunji Ruwase
be1147c08a
PLD release ( #513 )
...
* Progressive layer dropping docs (#499 )
* test
* Adding tutorial and news page for pld
* updating the tutorial and posts of PLD
* update the finetune tutorial
* Update PLD tutorial (#512 )
* Update installation instructions
* Format fix
* ZeRO tutorial
* Format fixes
* ZeRO-Offload
* ZeRO and ZeRO-Offload tutorials
* Update navigation page
* Format fixes
* Add yuxhe feedback
* Fix blog post link
* Fix OneBit-Adam link
Tweak scheduler example
* Fix date link
* Add DeepSpeed_Adam
* Add PLD tutorial to navigation
Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
* updating the pld docs
* DeepSpeed implementation of PLD (#508 )
* DeepSpeed implementation of PLD
* Format fixes
* Formatting fixes
* Fix broken url
* Address PR feedback
* Bump DSE
Co-authored-by: Minjia Zhang <33713995+minjiaz@users.noreply.github.com>
Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Co-authored-by: Minjia Zhang <minjiaz@microsoft.com>
2020-11-10 12:53:50 -08:00
Minjia Zhang
e082d4752e
updating pld docs ( #517 )
2020-11-10 06:33:35 -08:00
Olatunji Ruwase
41fb24b3f0
Fix PLD news url ( #515 )
...
* PLD documentation
* Formatting fixes
* Fix url bug
2020-11-09 12:28:14 -08:00
Olatunji Ruwase
e351090c6c
PLD documentation ( #514 )
...
* PLD documentation
* Formatting fixes
2020-11-09 12:20:02 -08:00
Reza Yazdani
f5aa2547d8
Add CPUAdam optimizer for zero-offload in deepspeed engine ( #484 )
...
* add adamW to CPU-ADAM implementation
* supporting cpu-adam optimizer for zero-offload on deepspeed side
* bump DSE to match cpu-adam updates
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-10-30 09:01:04 -07:00
Shaden Smith
d720fdb685
updating website dependencies ( #475 )
2020-10-19 16:11:22 -07:00
Reza Yazdani
e25f2a23e0
fixing typo ( #460 )
2020-10-12 09:36:18 -07:00
Olatunji Ruwase
23fc48f320
Add DeepSpeed_Adam optimizer ( #468 )
...
* Update installation instructions
* Format fix
* ZeRO tutorial
* Format fixes
* ZeRO-Offload
* ZeRO and ZeRO-Offload tutorials
* Update navigation page
* Format fixes
* Add yuxhe feedback
* Fix blog post link
* Fix OneBit-Adam link
Tweak scheduler example
* Fix date link
* Add DeepSpeed_Adam
Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-10-10 09:48:42 -07:00
niumanar
2efea69446
gan tutorial ( #462 )
...
* gan tutorial
* formatting fix
* adding pointer to repo; adding navigation link
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-10-06 22:17:34 -07:00
Shaden Smith
6d176c45fa
link fix part two :-) ( #441 )
2020-09-25 07:48:10 -07:00
Haibin Lin
0ca82156e1
Update pipeline.md ( #439 )
2020-09-24 21:57:15 -07:00
Conglong Li
192cf7c8e2
Update azure.md ( #437 )
2020-09-24 15:28:06 -07:00
Conglong Li
5d40f006bb
Fix urls in tutorial ( #436 )
...
* url fix
* revert absolute path but keep some actual fix
* add real readme
2020-09-24 14:10:56 -07:00
Gowtham Prudhvi
c66f388156
Fix few typos in the docs ( #418 )
2020-09-17 06:40:05 -07:00
Shaden Smith
5812e84544
readthedocs yaml configuration ( #410 )
...
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-09-16 18:57:43 -07:00
Olatunji Ruwase
7d91be9765
Minor doc fixes ( #417 )
...
* Update installation instructions
* Format fix
* ZeRO tutorial
* Format fixes
* ZeRO-Offload
* ZeRO and ZeRO-Offload tutorials
* Update navigation page
* Format fixes
* Add yuxhe feedback
* Fix blog post link
* Fix OneBit-Adam link
Tweak scheduler example
* Fix date link
Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-09-16 13:27:40 -07:00
Shaden Smith
c82756cd15
readthedocs upgrade ( #402 )
2020-09-10 15:44:47 -07:00
Olatunji Ruwase
d15015e969
Update ZeRO-Offload blog post link ( #401 )
...
Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-09-10 13:25:29 -07:00
Jeff Rasley
5dc4d6c72f
Update news site with press release link
2020-09-10 13:10:34 -07:00
Jeff Rasley
ea92ed29bb
Update _config.yml
2020-09-10 13:07:08 -07:00
Jeff Rasley
4b1df25ae9
bump DSE and doc tweak
2020-09-10 19:32:05 +00:00
Shaden Smith
6bb5c69f48
Website edits ( #398 )
...
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-09-10 09:18:53 -07:00
Jeff Rasley
a8a8b3d288
Landing page updates ( #395 )
...
Co-authored-by: Shaden Smith <ShadenTSmith@gmail.com>
Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com>
2020-09-10 02:04:17 -07:00
Arash Ashari
c76769c4ff
Adding sparse attention news index item ( #376 )
...
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-09-10 01:49:42 -07:00
Minjia Zhang
59ce90d0d4
Minjiaz/zero offload ( #382 )
...
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-09-10 01:46:06 -07:00
Arash Ashari
be4b94be26
Sparse attention: updating code tag in documentation ( #394 )
...
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-09-10 00:39:47 -07:00
Olatunji Ruwase
2dea61f285
ZeRO tutorials ( #384 )
...
Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-09-10 00:22:55 -07:00
Ammar Ahmad Awan
093f09ff27
Update documentation for 1-bit Adam ( #388 )
...
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-09-10 00:05:07 -07:00
Shaden Smith
65c2f974d8
Pipeline parallel training engine. ( #392 )
...
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-09-09 23:14:55 -07:00
Jeff Rasley
41db1c2f03
ZeRO-Offload release ( #391 )
...
* ZeRO-Offload (squash) (#381 )
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Reza Yazdani <reyazda@microsoft.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Co-authored-by: Jie <37380896+jren73@users.noreply.github.com>
Co-authored-by: Arash Ashari <arashari@microsoft.com>
Co-authored-by: Reza Yazdani <reyazda@microsoft.com>
Co-authored-by: Samyam Rajbhandari <samyamr@microsoft.com>
Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com>
Co-authored-by: arashashari <arashashari@ArashMSLaptop.redmond.corp.microsoft.com>
Co-authored-by: RezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com>
Co-authored-by: Reza Yazdani <reyazda@microsoft.com>
Co-authored-by: Samyam Rajbhandari <samyamr@microsoft.com>
Co-authored-by: Shaden Smith <Shaden.Smith@microsoft.com>
2020-09-09 17:14:12 -07:00
Arash Ashari
161e8e60c6
fixing a link issue with SA tutorial ( #387 )
...
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2020-09-09 15:04:17 -07:00
Ammar Ahmad Awan
01726ce2b8
Add 1-bit Adam support to DeepSpeed ( #380 )
...
* 1-bit adam (#353 )
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Co-authored-by: Your Name <you@example.com>
Co-authored-by: tanghl1994 <htang14@ur.rochester.edu>
Co-authored-by: Hank <tanghl1994@gmail.com>
Co-authored-by: root <root@node2x12b.cs.rochester.edu>
Co-authored-by: Ammar Ahmad Awan <awan.ammar@microsoft.com>
2020-09-09 14:37:37 -07:00
Arash Ashari
b73894dedc
adding sparse attention to feature index page ( #377 )
2020-09-08 19:15:44 -07:00
Arash Ashari
9dadf38dd6
Update Sparse Attention Tutorial ( #357 )
...
* adding BingSqaud e2e test
* updating the draft test; bring final step under try section
* finalizinf test for base deepspeed and deepspeed with ZeRO
* applying the comment (thanks Jeff); fixed formatting
* update Sparse Attention Tutorial
* fixed few issues and applied comments for better organization and readability
* updated sparse attention tutorial with making how to use section incremental; applying more comments
Co-authored-by: arashashari <arashashari@ArashMSLaptop.redmond.corp.microsoft.com>
2020-09-06 11:20:48 -07:00
Olatunji Ruwase
9e83ef21ea
Update installation instructions ( #362 )
2020-09-06 08:59:48 -07:00
Shaden Smith
ac12833ea7
Jekyll installation instructions ( #351 )
2020-09-03 23:35:03 -07:00
Arash Ashari
6deac82ca6
Adding link to Sparse Attention in Navigation page ( #355 )
...
* adding link to Sparse Attention in Navigation page
2020-09-03 15:01:23 -07:00
Jeff Rasley
e5bbc2e559
Sparse attn + ops/runtime refactor + v0.3.0 ( #343 )
...
* Sparse attn + ops/runtime refactor + v0.3.0
Co-authored-by: Arash Ashari <arashari@microsoft.com>
Co-authored-by: Arash Ashari <arashari@microsoft.com>
2020-09-01 18:06:15 -07:00
Shaden Smith
903a41a59d
updates website gems after kramdown alert ( #311 )
2020-08-08 15:51:54 -07:00
Jeff Rasley
29c5fe2611
Add webinar link ( #309 )
...
Add webinar on-demand links and update readme
2020-08-07 14:40:31 -07:00
Emmanuel Kahembwe
97c5427372
Fixing a typo ( #303 )
2020-07-28 14:24:12 -07:00
Shaden Smith
7ae8f8bc9b
DeepSpeed webinar announcement ( #301 )
2020-07-24 22:13:11 -07:00
Jeff Rasley
366d88164d
Update amp docs ( #287 )
...
* add amp docs
2020-07-13 13:04:07 -07:00
Conglong Li
6379292c62
Improving deepspeed.ai website ( #269 )
...
* syntax/typo fix
* add README for documentation
* fix links
* update navigation
* typo fix
* docs readme fix
2020-06-23 17:13:01 -07:00
Shaden Smith
664fa30cec
Ai scale ( #271 )
2020-06-19 18:52:42 -07:00
Shaden Smith
2e6d93e0e5
new transformer pre-ln image ( #268 )
2020-06-17 13:01:57 -07:00
Shaden Smith
9fcdc88595
adds gtag for website analytics ( #266 )
2020-06-16 17:41:24 -07:00
RezaYazdaniAminabadi
6e87251cd3
add the fine-tuning results ( #260 )
...
* add the fine-tuning results
* updating tutorial and blog-post
* updated the tutorials and links
2020-06-16 11:00:31 -07:00
Shaden Smith
2a1c5db1b0
Features links ( #252 )
...
* links and formatting
2020-06-04 12:58:49 -07:00