DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
machine-learning
deep-learning
pytorch
gpu
compression
billion-parameters
data-parallelism
inference
mixture-of-experts
model-parallelism
pipeline-parallelism
trillion-parameters
zero
Обновлено 2024-11-20 04:04:47 +03:00
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
machine-learning
python
deep-learning
pytorch
neural-architecture-search
deep-neural-networks
edge-computing
latency
inference
edge-ai
efficient-model
onnx-models
tensorflow-models
Обновлено 2024-07-31 00:16:53 +03:00