DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
machine-learning
deep-learning
pytorch
gpu
compression
billion-parameters
data-parallelism
inference
mixture-of-experts
model-parallelism
pipeline-parallelism
trillion-parameters
zero
Обновлено 2024-09-13 00:12:52 +03:00
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
machine-learning
python
deep-learning
pytorch
neural-architecture-search
deep-neural-networks
edge-computing
latency
inference
edge-ai
efficient-model
onnx-models
tensorflow-models
Обновлено 2024-07-31 00:16:53 +03:00