зеркало из https://github.com/microsoft/DeepSpeed.git
c37fe9cbfb
In the latest Pytorch nightly, the exception thrown from `torch.distributed.distributed_c10d.get_global_rank()` is changed from `RuntimeError` to `ValueError` so we need to update our try-catch in `deepspeed.comm` Tested with torch version 2.3.0.dev20231221+cu121 Fixes: https://github.com/microsoft/DeepSpeed/issues/4853 |
||
---|---|---|
.. | ||
__init__.py | ||
backend.py | ||
ccl.py | ||
comm.py | ||
config.py | ||
constants.py | ||
reduce_op.py | ||
torch.py | ||
utils.py |