Tutorial on how to deploy Deep Learning models on GPU enabled Kubernetes cluster

deep-learning flask gpu kubernetes python tensorflow

Перейти к файлу

MSalvaris 494fe6bcf5 Merge branch 'master' into mat_dev_revision		2018-04-10 11:38:52 +01:00
Keras_Tensorflow	clean up 3rd pass	2018-04-09 15:42:34 +00:00
Tensorflow	Updates notebooks. Adds documentation and fixes links	2018-04-10 10:36:13 +00:00
static	Adds example image	2018-03-31 21:52:20 +01:00
.gitignore	Initial commit	2018-03-20 03:47:11 -07:00
LICENSE	Initial working version of TF model	2018-03-21 11:44:11 +00:00
README.md	updates readme	2018-04-04 18:21:37 +00:00

README.md

Authors: Mathew Salvaris and Fidan Boylu Uz

Deploy Deep Learning CNN on Kubernetes Cluster with GPUs

In this repository there are a number of tutorials in Jupyter notebooks that have step-by-step instructions on how to deploy a pretrained deep learning model on a GPU enabled Kubernetes cluster. The tutorials cover how to deploy models from the following deep learning frameworks:

For each framework we go through 7 steps:

Model development where we load the pretrained model and test it by using it to score images
Developing the interface our Flask app will use to load and call the model
Building the Docker Image with our Flask REST API and model
Testing our Docker image before deployment
Creating our Kubernetes cluster and deploying our application to it
Testing the deployed model
Testing the throughput of our model

The application we will develop is a simple image classification service, where we will submit an image and get back what class the image belongs to.

If you already have a Docker image that you would like to deploy or you simply want to use the image we built you can skip the first four notebooks.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.

When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.