Граф коммитов

17 Коммитов

Автор SHA1 Сообщение Дата
Michael Wyatt baa95c6256
remove benchmarks (now in DSE) and add links (#3157)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2023-04-07 08:51:49 -07:00
Michael Wyatt b361c72761
Update DeepSpeed copyright license to Apache 2.0 (#3111)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2023-03-30 17:14:38 -07:00
Mayank Mishra a6317eb509
♻️ replace deprecated functions for communication (#2995)
* 💩 drop dead code

* ♻️ replace has_all_gather_base with has_all_gather_into_tensor

* ♻️ remove deprecated _all_gather_base

* ♻️ remove deprecated _reduce_scatter_base

* 🎨 reformat files

* 🔧 fix _six

* Trigger CI

* Trigger CI

* Trigger CI

* 🎨 formatting

* incorporate suggestion

* incorporate suggestion

---------

Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2023-03-29 22:27:15 -04:00
Quentin Anthony 9726bd4650
Fix comms benchmark import issues and support MPI/slurm launching (#2932)
* Fix benchmark import issues and support MPI launching with pure torch.dist

* Formatting

* Update comms benchmark README

* Formatting

* Added better error handling and support MPI torch.dist backend

* Update formatting versions

* Formatting again

* Trigger CI

---------

Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2023-03-27 17:15:18 -04:00
Jeff Rasley 91d63e0228
update formatter version and style settings (#3098) 2023-03-27 07:55:19 -04:00
Ma, Guokai 0acf7e9c48
[RFC] add device abstraction to allow other device than CUDA be used (#2221)
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2023-03-07 09:40:17 -08:00
Jeff Rasley da84e60d98
add missing license info to top of all source code (#2889)
Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
Co-authored-by: Conglong Li <conglong.li@gmail.com>
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
2023-02-27 11:20:41 -08:00
Ammar Ahmad Awan e4b3b610ba
Refactor DS inference API. No longer need replace_method. (#2831)
Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
2023-02-15 23:17:02 +00:00
Jeff Rasley 2076bf2311
get mask token from tokenizer (#2592) 2022-12-09 17:10:03 -08:00
Molly Smith 5eafb8c78d
Make error regex more generic in collect_results.py (#2415)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2022-10-14 12:47:37 -07:00
Molly Smith eed40324db
Capture error message during sweep tests (#2351)
* Collect error messages in results.csv

Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
2022-09-27 15:54:40 -07:00
Ammar Ahmad Awan 993264388d
Inference profiling updates/fixes (#2348) (#2349)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
2022-09-23 14:38:09 -07:00
Michael Wyatt 1592381018
Add more options to inference benchmark (#2325) 2022-09-16 09:58:28 -07:00
Jeff Rasley 9595dff6d7
add inference eval scripts (#2303) 2022-09-12 13:25:00 -07:00
Alex Hedges 316c4a43e0
Add flake8 to pre-commit checks (#2051) 2022-07-25 16:48:08 -07:00
Quentin Anthony 8413b7f83d
DS Benchmarks QoL Improvements (#2120)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2022-07-22 15:47:56 -07:00
Quentin Anthony 9b70ce56e7
Comms Benchmarks (#2040)
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
2022-06-29 10:49:20 -07:00