зеркало из https://github.com/microsoft/DeepSpeed.git
7260890452
The operation `.to('cpu') `is not necessary for exp_counts, and it will cause device to host synchronization which damage performance. Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com> |
||
---|---|---|
.. | ||
__init__.py | ||
experts.py | ||
layer.py | ||
mappings.py | ||
sharded_moe.py | ||
utils.py |