62107de30f
Bump tensorflow from 2.0.0-alpha0 to 2.4.2 in /code/seldon |
||
---|---|---|
.azdo | ||
.github | ||
code | ||
docs | ||
notifications/teams | ||
setup | ||
terraform | ||
terraform-bedrock | ||
.env.example | ||
.gitignore | ||
README.md |
README.md
KubeMLOps
Overview
The repository contains a sample created from the Kubeflow End-to-End Pipeline Example on Azure. It builds a Kubeflow pipeline(KFP) with both GitHub actions and Azure DevOps that trains a Tensorflow model recognizing tacos and burritos images. The model is registered in MLFlow. Kubeflow is running on Azure Kubernetes Service(AKS). The structure of this sample should make it easier to “bring your own code” and adopt the template for a real-life machine learning(ML) project.
The architecture of the sample is shown in the following diagram:
- The KFP contains the following steps:
- Data preprocessing
- Training
- Registering
- Deploying
- The model is registered and deployed in AML WS (working on MLFlow model registration also)
- The Continuous Integration(CI) pipeline is implemented with GitHub Actions, it is being triggered on PR, master merge, ChatOps comments (e.g. /build-images, /build-pipeline) and it has the following steps:
- Linting
- Builds Docker images – KFP components/steps
- Publishes the KFP as an artifact
- Uploads the pipeline to Kubeflow
- Runs the KFP
For this sample, DataOps for ML and Observability are currently out of scope.
Prerequisites
Before you get started, make sure you have the following prerequisites:
- An Azure Kubernetes Service cluster
- Ensure the latest version of Azure CLI is installed.
- Clone or fork the repository
Setup
This documentation helps you get started with the sample from infrastructure setup to deployment of the model.
- Kubeflow Installation on AKS
- Securing Kubeflow on AKS
- MLOps with GitHub
- MLOps with Azure DevOps
- Local Dev Experience with Kubeflow
- Experiment Tracking
- Enable GPU for Kubeflow Pipelines on Azure Kubernetes Service (AKS)
- Kubeflow Pipelines dashboards using kubeflow-scraper
Code for the following can be found in the code directory, but currently there is no documentation:
- Running Kubeflow component in parallel
- Running Jupyter Server within Kubeflow
- Running MLFlow Project from Kubeflow
- Model deployment with KF Serving