Граф коммитов

95 Коммитов

Автор SHA1 Сообщение Дата
David Tesar 62107de30f
Merge pull request #2 from microsoft/dependabot/pip/code/seldon/tensorflow-2.4.2
Bump tensorflow from 2.0.0-alpha0 to 2.4.2 in /code/seldon
2021-07-14 15:46:00 -07:00
dependabot[bot] efd9fc6c30
Bump tensorflow from 2.0.0-alpha0 to 2.4.2 in /code/seldon
Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 2.0.0-alpha0 to 2.4.2.
- [Release notes](https://github.com/tensorflow/tensorflow/releases)
- [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md)
- [Commits](https://github.com/tensorflow/tensorflow/compare/v2.0.0-alpha0...v2.4.2)

---
updated-dependencies:
- dependency-name: tensorflow
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
2021-07-14 22:45:21 +00:00
David Tesar bd74ab1907
Merge #1 from microsoft/dependabot/pip/code/kfserving/kfserving/tensorflow-2.4.2
Bump tensorflow from 2.0.0-alpha0 to 2.4.2 in /code/kfserving/kfserving
2021-07-14 15:44:40 -07:00
dependabot[bot] f4f8dfe5bc
Bump tensorflow from 2.0.0-alpha0 to 2.4.2 in /code/kfserving/kfserving
Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 2.0.0-alpha0 to 2.4.2.
- [Release notes](https://github.com/tensorflow/tensorflow/releases)
- [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md)
- [Commits](https://github.com/tensorflow/tensorflow/compare/v2.0.0-alpha0...v2.4.2)

---
updated-dependencies:
- dependency-name: tensorflow
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
2021-07-14 22:42:18 +00:00
dependabot[bot] 926c060f3c
Bump tensorflow from 2.0.0-alpha0 to 2.5.0rc0 in /code/kfserving/custom
Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 2.0.0-alpha0 to 2.5.0rc0.
- [Release notes](https://github.com/tensorflow/tensorflow/releases)
- [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md)
- [Commits](https://github.com/tensorflow/tensorflow/compare/v2.0.0-alpha0...v2.5.0-rc0)

Signed-off-by: dependabot[bot] <support@github.com>
2021-05-21 16:31:10 +00:00
Tom Care dc23d76614
Update CI Workflow docker folder paths (#111) 2020-11-06 15:43:46 -08:00
Tom Care f93cecfaa4
AzDO Callback: Raise an exception on an HTTP failure code (#106) 2020-11-06 15:34:16 -08:00
Tom Care 5845ea17be
Fix CI Docker build breaks (#109)
- Move register-artifacts to code/components/
- Update CI Docker steps to point to code/components/
2020-11-06 13:49:51 -08:00
Sabrina Smai c9828ab409
Fix CD for Azure DevOps (#105) 2020-11-05 16:11:24 -08:00
Bryan J Smith a71dd3346b
Deleted unused .py file (#107) 2020-11-05 16:10:28 -08:00
Bryan J Smith b9cda0d9ea
Updated documentation and cleaned up directories (#108) 2020-11-05 16:09:55 -08:00
Tom Care 9ce4cc6e53
Expand kubeflow-scraper docs with PVC instructions (#104) 2020-10-30 15:20:06 -07:00
Sabrina Smai e39e1e47c8
Fixed Github CD Pipeline (#98)
* Added trigger pipeline

* added header auth

* remove duplicate map of pipeline trigger

* adding access token retrieval

* remove whitespace

* adding on push to invoke cd

* test invoke cd on push

* fix push trigger

* comment handler deploy task

* fixed trial 1

* fixed auth pipline path

* replace token names to match auth file

* added adal import

* test comma

* revert comma addition

* Update cd.yml

* Update cd.yml

* Update cd.yml

* main function

* mlflowrunid

* run id

* auth token

* tokens

* uncomment

* changed pipeline trigger for ML register

* Change pipeline trigger for testing

For testing lint fix

* Fixed long line linting issue

* Comment Handle Deploy event for testing

Comment Handle Deploy event for testing

* added # noqa: E501 override lint

added # noqa: E501 override lint

* adding lint override 

adding lint override

* lint issue fix for -tag

* adding #E261 lint

* fix spacing lint issue

* reverting back docker build

* trigger pipeline test

* replacing runid with test

* echoing run_id

* revert comments handle deploy

Co-authored-by: Eugene Fedorenko <eugene.fedor@gmail.com>
2020-10-29 14:40:57 -07:00
Tom Care ea021cbd07
Enable node tolerations and label selectors in the pipeline (#103) 2020-10-27 14:53:28 -07:00
Tom Care 06267986f4
Add Grafana dashboards for cluster and training monitoring of Kubeflow Pipelines runs (#99)
* Add Grafana dashboards for cluster and training monitoring of Kubeflow Pipelines runs

Building on the prometheus exporter for Kubeflow Pipelines, this change introduces a hierarchy of dashboards for monitoring Kubeflow Pipelines runs.

Dashboard features:
- Kubeflow Experiment List
    - Overview of experiments and their job success rates
    - Node allocations and pod status
    - Unscheduled pods
- Kubeflow Experiment Detail
    - Overview of experiment stats incl. success rate, duration
    - List of runs with cumulative CPU usage, peak memory usage, duration, etc
    - Step information across all runs, showing top CPU and memory usage
    - Model metric table showing registered metrics for each run
- Kubeflow Run Detail
    - List of steps with pod information incl. duration, cumulative CPU usage, peak memory usage
    - Step lifetime display by pod
    - CPU and memory usage graph

* Update dashboards to latest version

* Update kubeflow-scraper README with information and screenshots of the dashboards
2020-10-23 16:24:48 -07:00
Bryan J Smith 6d21b88e49
Added new query and metric for experiment metrics (#97) 2020-10-15 11:51:53 -07:00
yilun-msft 4532cf48d2
Azure Manifest tests for service availability and pipeline execution (#90) 2020-10-14 12:57:55 -07:00
jotaylo 43c6a5deca
kustomization for Azure hosted dependencies (#86) 2020-10-14 11:59:46 -07:00
Eugene Fedorenko 4cdb1c7f27
Kubeflow Pipelines Scalability and Performance Testing (#93) 2020-10-14 11:31:38 -07:00
Tom Care 3663941714
Fix kubeflow-scraper to also query runs finished in the last 30min (#95) 2020-10-13 17:57:05 -07:00
Bryan J Smith 92f26664c7
Changed mlflow dependency to version 1.11+ (#94) 2020-10-08 16:33:40 -07:00
Sushant Divate 18be85c80d
fix link (#92) 2020-10-06 14:21:58 -07:00
Sabrina Smai 0d343e1b82
fixed broken link (#91) 2020-10-06 12:11:25 -07:00
Sabrina Smai 5b93734b80
Speeding up Docker image build (#85) 2020-10-06 10:47:04 -07:00
Tom Care bed318888f
Add Kubeflow Pipeline data scraper for Prometheus (#88) 2020-10-02 15:30:26 -07:00
jotaylo afae33ed0d
terraform template (#89) 2020-10-02 15:26:07 -07:00
Eugene Fedorenko 0d427db554
Notifications to Teams (#87) 2020-10-01 12:07:16 -07:00
Eugene Fedorenko 7f7bf24545
canary deployment (#84) 2020-09-25 14:10:23 -07:00
David Tesar f554263f8a
Add nodepool instructions (#75) 2020-08-24 19:45:52 -07:00
jotaylo dd0b82f5d3
Use PAT in env var/path to authenticate to Azure Pipelines (#82) 2020-08-13 15:16:33 -07:00
Sushant Divate 79373a9432
Enable GPU for Kubeflow Pipelines on Azure Kubernetes Service (AKS) (#83)
* gpu-op

* doc correction

* doc correction

* doc formatting

* fix kinting

* fix doc link
2020-08-13 14:16:56 -07:00
Sushant Divate cc8ef0e6d0
Sudivate/local experience (#80)
* local exp

* update gitignore with env

* local dev exp

* ignore compiled pipeline

* comment image building

* multistep pipeline from notebook components

* doc

* pipepline and sample .env

* added doc to index

* updated comment

* add gif

* fix linting

* doc fix

* replace gif

* add new flow

* doc correction

* doc correction

* doc correction

* fixed grammer

* logging notebook output

* punctuation fix
2020-08-07 16:12:41 -07:00
Sushant Divate 617ef9c074
compiling pipeline in PR (#81) 2020-08-06 18:21:03 -07:00
Bryan J Smith ec3256d2da
Added Experiment Tracking doc and corresponding code (#79)
* Initial commit of Kubeflow Pipeline MLFlow Experiment Run Dashboard

* Modified Power BI report and Kube flow installation doc. Added Experiment Tracking doc and corresponding images

* Added link to Experiment Tracking doc

* Added more friendlier naming and converted millisecond timestamp to datetime

* Added links for Kubeflow pipelines and MLflow experiments, changed Experiment tracking doc structure, and fixed error in MLOps Github doc

* Added a few screenshots from Kubeflow and MLflow dashboards
2020-08-05 11:31:32 -07:00
Sushant Divate f53b53c971
AML Register Model Component for upstream (#78) 2020-07-24 11:53:46 -07:00
Sushant Divate 127321dfe1
Improved Security in pipeline (#77) 2020-07-23 14:33:23 -07:00
jotaylo 1561974db5
Add Kubeflow component to queue Azure DevOps pipeline (#76) 2020-07-23 10:46:22 -07:00
Eugene Fedorenko 3a5a5330a1
Pipeline components (#72) 2020-07-21 13:55:25 -07:00
Sushant Divate d81156416e
Removed kubeflow register component and updated endpoint (#74)
* add pipeline metdata

* added pipeline description

* remove kubeflow register and update endpoint
2020-07-20 17:19:38 -07:00
Sushant Divate 6ac55c6d9a
Trace back KF pipelines to Azure DevOps pipeline and other metedata (#73) 2020-07-20 15:21:44 -07:00
Eugene Fedorenko 273e72814c
KFP Upgrade (#69) 2020-07-20 15:00:22 -07:00
jotaylo 0884feaaf9
Use correct tags in CI pipeline (#71) 2020-07-20 11:53:42 -07:00
David Tesar e38102dfb7
add latest AKS features (#70) 2020-07-17 14:25:18 -07:00
Eugene Fedorenko 2b5422a727
Seldon inference pipeline (#68) 2020-07-17 12:07:51 -07:00
jotaylo 840ec0e5da
Add Azure Pipelines CD pipeline (#67) 2020-07-10 16:23:52 -07:00
Bryan J Smith 9bf05ea542
Added recent Bedrock changes (#66) 2020-07-08 16:09:17 -07:00
Bryan J Smith 301b072ed4
Azure DevOps documentation (#65)
* Added Azure DevOps documentation
2020-07-07 15:25:44 -07:00
Bryan J Smith 86435410a4
Fixed Final Issues (#64)
* Fixed Final Issues
2020-06-29 22:33:28 -07:00
Bryan J Smith d4bdf5134a
Update README.md (#63)
Fixed typo in path to architecture diagram
2020-06-29 22:13:51 -07:00
Bryan J Smith 87651db56d
Update README.md (#62)
Fixed typo in MLOps with GitHub paths
2020-06-29 22:08:53 -07:00