Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYCL for CPU/GPU, OpenCL for AMD/NVIDIA, Android CPU/GPU backends.
Обновлено 2024-11-20 16:03:41 +03:00
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
machine-learning
deep-learning
pytorch
gpu
compression
billion-parameters
data-parallelism
inference
mixture-of-experts
model-parallelism
pipeline-parallelism
trillion-parameters
zero
Обновлено 2024-11-20 04:04:47 +03:00
Обновлено 2024-11-20 03:23:05 +03:00
DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. DirectML provides GPU acceleration for common machine learning tasks across a broad range of supported hardware and drivers, including all DirectX 12-capable GPUs from vendors such as AMD, Intel, NVIDIA, and Qualcomm.
Обновлено 2024-11-19 03:35:36 +03:00
Metadata for Azure HPC Extensions
Обновлено 2024-11-19 02:38:24 +03:00
The high-speed OpenGL, OpenCL, OpenAL, OpenXR, GLFW, SDL, Vulkan, Assimp, WebGPU, and DirectX bindings library your mother warned you about.
csharp
graphics
audio
game-development
glfw
graphics-library
haptics
native
openal
opencl
opengl
scientific-visualization
silk
vulkan
webgpu
wgpu
3d
Обновлено 2024-11-18 18:39:11 +03:00
Win2D is an easy-to-use Windows Runtime API for immediate mode 2D graphics rendering with GPU acceleration. It is available to C#, C++ and VB developers writing apps for the Windows Universal Platform (UWP). It utilizes the power of Direct2D, and integrates seamlessly with XAML and CoreWindow.
Обновлено 2024-10-11 19:04:27 +03:00
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
machine-learning
deep-learning
tensorflow
neural-networks
embedded
speech-recognition
deepspeech
offline
on-device
speech-to-text
Обновлено 2024-09-04 00:17:43 +03:00
Resource scheduling and cluster management for AI
machine-learning
tensorflow
ai
jupyter
pytorch
artificial-intelligence
model-training
kubernetes
gpu-scheduler
on-premise
resource-management
scheduling
chainer
cloud
cluster-management
cluster-manager
gpu
gpu-cluster
gpu-computing
Обновлено 2024-06-06 10:56:06 +03:00
This example shows how to convert VideoPlayer texture to OpenCV Mat using AsyncGPUReadback.
Обновлено 2024-04-24 14:18:00 +03:00
Radeon Rays is ray intersection acceleration library for hardware and software multiplatforms using CPU and GPU
Обновлено 2024-02-12 15:19:19 +03:00
A Dataset of Python Challenges for AI Research
Обновлено 2023-12-21 00:10:56 +03:00
A GPU / device extension framework for Kubernetes
Обновлено 2023-06-27 16:03:52 +03:00
Demostrates using the NVidia GPU Cloud Marketplace images in CycleCloud clusters.
Обновлено 2023-05-31 21:47:40 +03:00
Contains intermediate and compiled WebGPU Shading Language shaders from various Unity projects, to be used for shader compiler testing and profiling by WebGPU implementers.
Обновлено 2023-04-20 06:34:12 +03:00
Simplify HPC and Batch workloads on Azure
azure
docker
azure-functions
containers
serverless
gpu
azure-batch
hpc
infiniband
windows-containers
mpi
slurm
rdma
batch-processing
glusterfs
nfs
singularity
Обновлено 2023-03-21 00:31:21 +03:00
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Обновлено 2022-11-28 22:09:42 +03:00
A highly extensible software stack to empower everyone to build practical real-world live video analytics applications for object detection and counting with cutting edge machine learning algorithms.
azure
docker
dotnet-core
tensorflow
object-detection
gpu
counting
edge-computing
video-analytics
yolov3
Обновлено 2022-07-26 04:18:01 +03:00
A profiler to disclose and quantify hardware features on GPUs.
Обновлено 2022-05-15 15:14:59 +03:00
Scripts that run on Azure VM's and gather variety of diagnostic information to debug common issues with VM, GPU and Infiniband.
Обновлено 2022-03-29 00:17:54 +03:00
Svirl is GPU-accelerated solver of complex Ginzburg-Landau equations for superconductivity. It consists of time-dependent solver to describe vortex dynamics and free energy minimizer to accurately find static configurations.
Обновлено 2021-07-23 21:41:12 +03:00
AKS Deployment Tutorial
Обновлено 2020-02-12 05:11:52 +03:00
Optimized primitives for collective multi-GPU communication
Обновлено 2020-01-07 02:49:13 +03:00
Distributed Deep Learning using AzureML
Обновлено 2019-11-19 05:28:26 +03:00
Show how to perform fast retraining with LightGBM in different business cases
azure
machine-learning
benchmark
gpu
lightgbm
distributed-systems
gbdt
xgboost
gbm
gbrt
kaggle
boosted-trees
Обновлено 2019-07-18 11:16:44 +03:00
Tutorial on how to deploy Deep Learning models on GPU enabled Kubernetes cluster
Обновлено 2019-02-01 22:04:55 +03:00
Lumia Imaging SDK is a comprehensive set of GPU/CPU imaging tools and effects that run on both mobile and desktop, with high speed and memory efficiency. Samples and extras code are included.
Обновлено 2015-09-30 09:32:45 +03:00