AKS Deployment Tutorial

azureml deep deep-learning gpu kubernetes

Перейти к файлу

YanZhangADS 477043606e fix package import		2019-01-24 20:22:50 +00:00
Keras_Tensorflow	fix package import	2019-01-24 20:22:50 +00:00
static	add example image	2019-01-23 17:29:26 +00:00
.gitignore	Initial commit	2018-08-31 11:47:39 -07:00
LICENSE	Initial commit	2018-08-31 11:47:41 -07:00
README.md	enhance doc	2019-01-23 17:55:50 +00:00

README.md

Authors: Yan Zhang, Mathew Salvaris, and Fidan Boylu Uz

Deploy Deep Learning CNN on Kubernetes Cluster with GPUs - AML version

Overview

In this repository there are a number of tutorials in Jupyter notebooks that have step-by-step instructions on how to deploy a pretrained deep learning model on a GPU enabled Kubernetes cluster throught Azure Machine Learning (AML). The tutorials cover how to deploy models from the following deep learning frameworks:

For each framework, we go through the following steps:

Create an AML Workspace
Model development where we load the pretrained model and test it by using it to score images
Develop the API that will call our model
Building the Docker Image with our REST API and model and testing the image
Creating our Kubernetes cluster and deploying our application to it
Testing the deployed model
Testing the throughput of our model
Cleaning up resources

Design

The application we will develop is a simple image classification service, where we will submit an image and get back what class the image belongs to. The application flow for the deep learning model is as follows:

Deep learning model is registered to AML model registry.
AML creates a docker image including the model and scoring script.
AML deploys the scoring image on Azure Kubernetes Service (AKS) as a web service.
The client sends a HTTP POST request with the encoded image data.
The web service created by AML preprocesses the image data and sends it to the model for scoring.
The predicted categories with their scores are then returned to the client.

NOTE: The tutorial goes through step by step how to deploy a deep learning model on Azure; it does not include enterprise best practices such as securing the endpoints and setting up remote logging etc.

Deploying with GPUS: For a detailed comparison of the deployments of various deep learning models, see the blog post here which provides evidence that, at least in the scenarios tested, GPUs provide better throughput and stability at a lower cost.

Setup

Please find out the Prerequisites and Setup procedures for different networks:

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.

When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.