refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
# imports
|
|
|
|
import os
|
|
|
|
import json
|
|
|
|
import glob
|
|
|
|
import argparse
|
|
|
|
|
|
|
|
# define constants
|
2021-05-26 16:58:03 +03:00
|
|
|
EXCLUDED_JOBS = ["cifar"]
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
EXCLUDED_ENDPOINTS = ["conda.yml", "environment.yml", "batch", "online"]
|
|
|
|
EXCLUDED_ASSETS = [
|
|
|
|
"conda.yml",
|
|
|
|
"environment.yml",
|
|
|
|
"conda-envs",
|
|
|
|
"mlflow-models",
|
|
|
|
"workspace",
|
|
|
|
]
|
2021-06-08 17:11:10 +03:00
|
|
|
EXCLUDED_SCRIPTS = ["setup", "cleanup"]
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
|
|
|
|
# define functions
|
|
|
|
def main(args):
|
|
|
|
# get list of jobs
|
|
|
|
jobs = sorted(glob.glob("jobs/**/*job*.yml", recursive=True))
|
2021-06-03 02:10:54 +03:00
|
|
|
jobs += sorted(glob.glob("jobs/*.yml", recursive=False))
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
jobs = [
|
|
|
|
job.replace(".yml", "")
|
|
|
|
for job in jobs
|
|
|
|
if not any(excluded in job for excluded in EXCLUDED_JOBS)
|
|
|
|
]
|
|
|
|
|
|
|
|
# get list of endpoints
|
|
|
|
endpoints = sorted(glob.glob("endpoints/**/*.yml", recursive=True))
|
|
|
|
endpoints = [
|
|
|
|
endpoint.replace(".yml", "")
|
|
|
|
for endpoint in endpoints
|
|
|
|
if not any(excluded in endpoint for excluded in EXCLUDED_ENDPOINTS)
|
|
|
|
]
|
|
|
|
|
|
|
|
# get list of assets
|
|
|
|
assets = sorted(glob.glob("assets/**/*.yml", recursive=True))
|
|
|
|
assets = [
|
|
|
|
asset.replace(".yml", "")
|
|
|
|
for asset in assets
|
|
|
|
if not any(excluded in asset for excluded in EXCLUDED_ASSETS)
|
|
|
|
]
|
|
|
|
|
2021-06-08 17:11:10 +03:00
|
|
|
# get list of scripts
|
|
|
|
scripts = sorted(glob.glob("*.sh", recursive=False))
|
|
|
|
scripts = [
|
|
|
|
script.replace(".sh", "")
|
|
|
|
for script in scripts
|
|
|
|
if not any(excluded in script for excluded in EXCLUDED_SCRIPTS)
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
]
|
|
|
|
|
|
|
|
# write workflows
|
2021-06-08 17:11:10 +03:00
|
|
|
write_workflows(jobs, endpoints, assets, scripts)
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
|
|
|
|
# read existing README.md
|
|
|
|
with open("README.md", "r") as f:
|
|
|
|
readme_before = f.read()
|
|
|
|
|
|
|
|
# write README.md
|
2021-06-08 17:11:10 +03:00
|
|
|
write_readme(jobs, endpoints, assets, scripts)
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
|
|
|
|
# read modified README.md
|
|
|
|
with open("README.md", "r") as f:
|
|
|
|
readme_after = f.read()
|
|
|
|
|
|
|
|
# check if readme matches
|
|
|
|
if args.check_readme:
|
|
|
|
if not check_readme(readme_before, readme_after):
|
|
|
|
print("README.md file did not match...")
|
|
|
|
exit(2)
|
|
|
|
|
|
|
|
|
2021-06-08 17:11:10 +03:00
|
|
|
def write_readme(jobs, endpoints, assets, scripts):
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
# read in prefix.md and suffix.md
|
|
|
|
with open("prefix.md", "r") as f:
|
|
|
|
prefix = f.read()
|
|
|
|
with open("suffix.md", "r") as f:
|
|
|
|
suffix = f.read()
|
|
|
|
|
|
|
|
# define markdown tables
|
|
|
|
jobs_table = "\n**Jobs** ([jobs](jobs))\n\npath|status|description\n-|-|-\n"
|
|
|
|
endpoints_table = (
|
|
|
|
"\n**Endpoints** ([endpoints](endpoints))\n\npath|status|description\n-|-|-\n"
|
|
|
|
)
|
|
|
|
assets_table = "\n**Assets** ([assets](assets))\n\npath|status|description\n-|-|-\n"
|
2021-06-08 17:11:10 +03:00
|
|
|
scripts_table = "\n**Scripts**\n\npath|status|\n-|-\n"
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
|
|
|
|
# process jobs
|
|
|
|
for job in jobs:
|
|
|
|
# build entries for tutorial table
|
|
|
|
status = f"[![{job}](https://github.com/Azure/azureml-examples/workflows/cli-{job.replace('/', '-')}/badge.svg)](https://github.com/Azure/azureml-examples/actions?query=workflow%3Acli-{job.replace('/', '-')})"
|
|
|
|
description = "*no description*"
|
|
|
|
try:
|
|
|
|
with open(f"{job}.yml", "r") as f:
|
|
|
|
for line in f.readlines():
|
|
|
|
if "description: " in str(line):
|
|
|
|
description = line.split(": ")[-1].strip()
|
|
|
|
break
|
|
|
|
except:
|
|
|
|
pass
|
|
|
|
|
|
|
|
# add row to tutorial table
|
|
|
|
row = f"[{job}.yml]({job}.yml)|{status}|{description}\n"
|
|
|
|
jobs_table += row
|
|
|
|
|
|
|
|
# process endpoints
|
|
|
|
for endpoint in endpoints:
|
|
|
|
# build entries for tutorial table
|
|
|
|
status = f"[![{endpoint}](https://github.com/Azure/azureml-examples/workflows/cli-{endpoint.replace('/', '-')}/badge.svg)](https://github.com/Azure/azureml-examples/actions?query=workflow%3Acli-{endpoint.replace('/', '-')})"
|
|
|
|
description = "*no description*"
|
|
|
|
try:
|
|
|
|
with open(f"{endpoint}.yml", "r") as f:
|
|
|
|
for line in f.readlines():
|
|
|
|
if "description: " in str(line):
|
|
|
|
description = line.split(": ")[-1].strip()
|
|
|
|
break
|
|
|
|
except:
|
|
|
|
pass
|
|
|
|
|
|
|
|
# add row to tutorial table
|
|
|
|
row = f"[{endpoint}.yml]({endpoint}.yml)|{status}|{description}\n"
|
|
|
|
endpoints_table += row
|
|
|
|
|
|
|
|
# process assets
|
|
|
|
for asset in assets:
|
|
|
|
# build entries for tutorial table
|
|
|
|
status = f"[![{asset}](https://github.com/Azure/azureml-examples/workflows/cli-{asset.replace('/', '-')}/badge.svg)](https://github.com/Azure/azureml-examples/actions?query=workflow%3Acli-{asset.replace('/', '-')})"
|
|
|
|
description = "*no description*"
|
|
|
|
try:
|
|
|
|
with open(f"{asset}.yml", "r") as f:
|
|
|
|
for line in f.readlines():
|
|
|
|
if "description: " in str(line):
|
|
|
|
description = line.split(": ")[-1].strip()
|
|
|
|
break
|
|
|
|
except:
|
|
|
|
pass
|
|
|
|
|
|
|
|
# add row to tutorial table
|
|
|
|
row = f"[{asset}.yml]({asset}.yml)|{status}|{description}\n"
|
|
|
|
assets_table += row
|
|
|
|
|
2021-06-08 17:11:10 +03:00
|
|
|
# process scripts
|
|
|
|
for script in scripts:
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
# build entries for tutorial table
|
2021-06-08 17:11:10 +03:00
|
|
|
status = f"[![{script}](https://github.com/Azure/azureml-examples/workflows/cli-scripts-{script}/badge.svg)](https://github.com/Azure/azureml-examples/actions?query=workflow%3Acli-scripts-{script})"
|
|
|
|
link = f"https://scripts.microsoft.com/azure/machine-learning/{script}"
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
|
|
|
|
# add row to tutorial table
|
2021-06-08 17:11:10 +03:00
|
|
|
row = f"[{script}.sh]({script}.sh)|{status}\n"
|
|
|
|
scripts_table += row
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
|
|
|
|
# write README.md
|
|
|
|
print("writing README.md...")
|
|
|
|
with open("README.md", "w") as f:
|
|
|
|
f.write(
|
2021-06-08 17:11:10 +03:00
|
|
|
prefix
|
|
|
|
+ scripts_table
|
|
|
|
+ jobs_table
|
|
|
|
+ endpoints_table
|
|
|
|
+ assets_table
|
|
|
|
+ suffix
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
)
|
|
|
|
|
|
|
|
|
2021-06-08 17:11:10 +03:00
|
|
|
def write_workflows(jobs, endpoints, assets, scripts):
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
print("writing .github/workflows...")
|
|
|
|
|
|
|
|
# process jobs
|
|
|
|
for job in jobs:
|
|
|
|
# write workflow file
|
|
|
|
write_job_workflow(job)
|
|
|
|
|
|
|
|
# process endpoints
|
|
|
|
for endpoint in endpoints:
|
|
|
|
# write workflow file
|
|
|
|
# write_endpoint_workflow(endpoint)
|
|
|
|
pass
|
|
|
|
|
|
|
|
# process assest
|
|
|
|
for asset in assets:
|
|
|
|
# write workflow file
|
|
|
|
write_asset_workflow(asset)
|
|
|
|
|
2021-06-08 17:11:10 +03:00
|
|
|
# process scripts
|
|
|
|
for script in scripts:
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
# write workflow file
|
2021-06-08 17:11:10 +03:00
|
|
|
write_script_workflow(script)
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
|
|
|
|
|
|
|
|
def check_readme(before, after):
|
|
|
|
return before == after
|
|
|
|
|
|
|
|
|
|
|
|
def parse_path(path):
|
|
|
|
filename = None
|
|
|
|
project_dir = None
|
|
|
|
hyphenated = None
|
|
|
|
try:
|
|
|
|
filename = path.split("/")[-1]
|
|
|
|
except:
|
|
|
|
pass
|
|
|
|
try:
|
|
|
|
project_dir = "/".join(path.split("/")[:-1])
|
|
|
|
except:
|
|
|
|
pass
|
|
|
|
try:
|
|
|
|
hyphenated = path.replace("/", "-")
|
|
|
|
except:
|
|
|
|
pass
|
|
|
|
|
|
|
|
return filename, project_dir, hyphenated
|
|
|
|
|
|
|
|
|
|
|
|
def write_job_workflow(job):
|
|
|
|
filename, project_dir, hyphenated = parse_path(job)
|
2021-05-22 01:47:49 +03:00
|
|
|
creds = "${{secrets.AZ_AE_CREDS}}"
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
workflow_yaml = f"""name: cli-{hyphenated}
|
|
|
|
on:
|
|
|
|
schedule:
|
|
|
|
- cron: "0 0/4 * * *"
|
|
|
|
pull_request:
|
|
|
|
branches:
|
|
|
|
- main
|
|
|
|
- cli-preview
|
|
|
|
paths:
|
|
|
|
- cli/{project_dir}/**
|
|
|
|
- .github/workflows/cli-{hyphenated}.yml
|
|
|
|
jobs:
|
|
|
|
build:
|
|
|
|
runs-on: ubuntu-latest
|
|
|
|
steps:
|
|
|
|
- name: check out repo
|
|
|
|
uses: actions/checkout@v2
|
|
|
|
- name: azure login
|
|
|
|
uses: azure/login@v1
|
|
|
|
with:
|
|
|
|
creds: {creds}
|
2021-05-21 19:26:59 +03:00
|
|
|
- name: install ml cli
|
|
|
|
run: az extension add -n ml -y
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
- name: setup
|
|
|
|
run: bash setup.sh
|
|
|
|
working-directory: cli
|
|
|
|
- name: create job
|
|
|
|
run: |
|
|
|
|
run_id=$(az ml job create -f {job}.yml --query name -o tsv)
|
|
|
|
az ml job stream -n $run_id
|
|
|
|
status=$(az ml job show -n $run_id --query status -o tsv)
|
|
|
|
echo $status
|
|
|
|
if [[ $status == "Completed" ]]
|
|
|
|
then
|
|
|
|
echo "Job completed"
|
|
|
|
elif [[ $status == "Failed" ]]
|
|
|
|
then
|
|
|
|
echo "Job failed"
|
|
|
|
exit 1
|
|
|
|
else
|
|
|
|
echo "Job status not failed or completed"
|
|
|
|
exit 2
|
|
|
|
fi
|
|
|
|
working-directory: cli\n"""
|
|
|
|
|
|
|
|
# write workflow
|
|
|
|
with open(f"../.github/workflows/cli-{job.replace('/', '-')}.yml", "w") as f:
|
|
|
|
f.write(workflow_yaml)
|
|
|
|
|
|
|
|
|
|
|
|
def write_endpoint_workflow(endpoint):
|
|
|
|
filename, project_dir, hyphenated = parse_path(endpoint)
|
2021-05-22 01:47:49 +03:00
|
|
|
creds = "${{secrets.AZ_AE_CREDS}}"
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
workflow_yaml = f"""name: cli-{hyphenated}
|
|
|
|
on:
|
|
|
|
schedule:
|
|
|
|
- cron: "0 0/4 * * *"
|
|
|
|
pull_request:
|
|
|
|
branches:
|
|
|
|
- main
|
|
|
|
- cli-preview
|
|
|
|
paths:
|
|
|
|
- cli/{project_dir}/**
|
|
|
|
- .github/workflows/cli-{hyphenated}.yml
|
|
|
|
jobs:
|
|
|
|
build:
|
|
|
|
runs-on: ubuntu-latest
|
|
|
|
steps:
|
|
|
|
- name: check out repo
|
|
|
|
uses: actions/checkout@v2
|
|
|
|
- name: azure login
|
|
|
|
uses: azure/login@v1
|
|
|
|
with:
|
|
|
|
creds: {creds}
|
2021-05-21 19:26:59 +03:00
|
|
|
- name: install ml cli
|
|
|
|
run: az extension add -n ml -y
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
- name: setup workspace
|
|
|
|
run: bash setup.sh
|
|
|
|
working-directory: cli
|
|
|
|
- name: create endpoint
|
|
|
|
run: az ml endpoint create -f {endpoint}.yml
|
|
|
|
working-directory: cli\n"""
|
|
|
|
|
|
|
|
# write workflow
|
|
|
|
with open(f"../.github/workflows/cli-{hyphenated}.yml", "w") as f:
|
|
|
|
f.write(workflow_yaml)
|
|
|
|
|
|
|
|
|
|
|
|
def write_asset_workflow(asset):
|
|
|
|
filename, project_dir, hyphenated = parse_path(asset)
|
2021-05-22 01:47:49 +03:00
|
|
|
creds = "${{secrets.AZ_AE_CREDS}}"
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
workflow_yaml = f"""name: cli-{hyphenated}
|
|
|
|
on:
|
|
|
|
schedule:
|
|
|
|
- cron: "0 0/4 * * *"
|
|
|
|
pull_request:
|
|
|
|
branches:
|
|
|
|
- main
|
|
|
|
- cli-preview
|
|
|
|
paths:
|
|
|
|
- cli/{asset}.yml
|
|
|
|
- .github/workflows/cli-{hyphenated}.yml
|
|
|
|
jobs:
|
|
|
|
build:
|
|
|
|
runs-on: ubuntu-latest
|
|
|
|
steps:
|
|
|
|
- name: check out repo
|
|
|
|
uses: actions/checkout@v2
|
|
|
|
- name: azure login
|
|
|
|
uses: azure/login@v1
|
|
|
|
with:
|
|
|
|
creds: {creds}
|
2021-05-21 19:26:59 +03:00
|
|
|
- name: install ml cli
|
|
|
|
run: az extension add -n ml -y
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
- name: setup workspace
|
|
|
|
run: bash setup.sh
|
|
|
|
working-directory: cli
|
|
|
|
- name: create asset
|
|
|
|
run: az ml {asset.split('/')[1]} create -f {asset}.yml
|
|
|
|
working-directory: cli\n"""
|
|
|
|
|
|
|
|
# write workflow
|
|
|
|
with open(f"../.github/workflows/cli-{hyphenated}.yml", "w") as f:
|
|
|
|
f.write(workflow_yaml)
|
|
|
|
|
|
|
|
|
2021-06-08 17:11:10 +03:00
|
|
|
def write_script_workflow(script):
|
|
|
|
filename, project_dir, hyphenated = parse_path(script)
|
2021-05-22 01:47:49 +03:00
|
|
|
creds = "${{secrets.AZ_AE_CREDS}}"
|
2021-06-08 17:11:10 +03:00
|
|
|
workflow_yaml = f"""name: cli-scripts-{hyphenated}
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
on:
|
|
|
|
schedule:
|
|
|
|
- cron: "0 0/4 * * *"
|
|
|
|
pull_request:
|
|
|
|
branches:
|
|
|
|
- main
|
|
|
|
- cli-preview
|
|
|
|
paths:
|
2021-06-08 17:11:10 +03:00
|
|
|
- cli/{script}.sh
|
|
|
|
- .github/workflows/cli-scripts-{hyphenated}.yml
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
jobs:
|
|
|
|
build:
|
|
|
|
runs-on: ubuntu-latest
|
|
|
|
steps:
|
|
|
|
- name: check out repo
|
|
|
|
uses: actions/checkout@v2
|
|
|
|
- name: azure login
|
|
|
|
uses: azure/login@v1
|
|
|
|
with:
|
|
|
|
creds: {creds}
|
2021-05-21 19:26:59 +03:00
|
|
|
- name: install ml cli
|
|
|
|
run: az extension add -n ml -y
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
- name: setup workspace
|
|
|
|
run: bash setup.sh
|
|
|
|
working-directory: cli
|
2021-06-08 17:11:10 +03:00
|
|
|
- name: scripts installs
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
run: sudo apt-get upgrade -y && sudo apt-get install uuid-runtime jq -y
|
2021-06-08 17:11:10 +03:00
|
|
|
- name: test script script
|
|
|
|
run: set -e; bash -x {script}.sh
|
2021-05-26 16:58:03 +03:00
|
|
|
working-directory: cli\n"""
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
|
|
|
|
# write workflow
|
2021-06-08 17:11:10 +03:00
|
|
|
with open(f"../.github/workflows/cli-scripts-{hyphenated}.yml", "w") as f:
|
refactor for CLI 2.0 preview (#495)
* scaffolding
* fixes, might break
* update contributing
* fix
* try 0.63
* fix readme
* order of cells
* fix env
* uncomment lines
* try a job
* add job test
* try miniconda image
* try mcr image
* -n -> --name for az ml workspace create
* fix workspace create
* same
* isolate rg
* fix filename
* simply env; increment version
* add mlflow
* fix enviornment, increment version
* switch to .sh
* add workspace create
* quick fix
* fix typo
* add asset creation
* debugging
* fix action
* fix
* typo
* fix typo
* workflow edits
* fix?
* edits
* fix data?
* Add more commands
* try something
* remove notebook
* update
* fix?
* try data workaround
* fix training script
* add azureml-mlflow to env
* saving
* try sweep job
* sweep edits
* fix yaml
* remove limits
* testing
* edits
* minor edits to basic.yml
* add exp name
* saving
* save; green?
* move --web to untested.sh
* hmm
* add model assets
* remove dsstore
* saving w/ model create
* add endpoints dir
* try something for docs
* try for docs
* try anon data and assets
* edit
* path edit
* fix asset path
* move stuff around
* try ./
* fix mistake
* save
* add mount
* fix
* change model to use run
* remove model register
* data hack
* test
* test registered file dataset
* saving
* testing
* whoops
* change strategy
* fix name
* switch rg back
* switch creds
* switch lightgbm back
* fix ws
* add file:
* remove deepspeed
* try tensorflow
* remove tensorflow
* edits for docs
* add verify
* unisntall -> remove
* -0
* saving
* add check_extensions
* fix mistake
* saving; might break
* update manage-assets and test it
* whoops
* minor edits
* local hello world
* edits
* simplify workflow
* try compute job
* ?
* try compute type
* fix hello world
* fix?
* major refactor
* remove line
* comments
* save
* save
* order of operations
* saving - run sweep job
* magic
* switch to data - possible typos
* fix typos?
* ...
* try dedenting for sweep
* edits
* names
* sweep hack for now
* broke stuff?
* test distributed
* update query logic
* add exp name, :1 to env
* wait for job to complete
* fix user error
* env to version 11
* readme edit
* try something
* remove redundant stuff
* version
* try downloading cifar
* try job status check
* debug bash
* fixes
* try stuff
* add back log upload
* fix bash?
* copy examples/endpoints from v2-preview
* run formatting
* add batch endpoint (#427)
* hack
* move compute create to setup.sh; --name to -n for workspace create
* NC6 -> NC12
* try randint sweep
* remove randint from sweep; download=False for pytorch-cifar
* comment out logs upload from worklow
* slim iris lightgbm code
* try something for cifar
* fix
* preprocess -> process
* mlflow.pytorch log model for cifar
* no mlflow, fix cifar
* run formatting
* relocate hello world
* whoops
* cifar epochs 20 -> 2
* try new env for pytorch gpu
* switch pytorch to mlflow logging
* cleanup cifar code
* fix
* write to ./model?
* edit batch endpoint asset path
* force add model.pkl
* copy workspace assets
* add some schemas, more needed
* add more schemas
* fix schemas
* Updating the schema link to CDN endpoint (#428)
* Update batch endpoint test (#429)
* update schemas
* fix more schemas
* simplify
* Revert "simplify"
This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b.
* Update batch endpoint sample (#432)
* test python/cli refactor (#431)
* saving
* Update environment.yml (#430)
* ?
* testing
* testing
* testing
* cli fix
* fix
* saving
* saving
* whoops
* fix readme
* fix readme
* whoops
* minor readme edit
* last readme edits
* add setup workspace step
* fix cli tests (#433)
* test
* testing
* fixes
* save
* test
* fix how-to-manage-assets
* minor edit
* fix
* fix setup-workspace
* fix readme (#434)
* edits for docs (#437)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* minor edits for docs (#438)
* fix readme
* remove sweep hack
* readme fix
* command => sweep in schema
* saving
* add back setup-workspace.sh
* ?
* sweep hack
* Saving
* edit readme asset logic
* -A
* >- => |
* missed hello world
* remove old env
* blind copy
* yolo
* edits
* edits
* docs edits (#440)
* docs edits
* save
* remove upgrade
* fix
* reorder
* save (#442)
* edit training jobs (#443)
* saving
* minor edit
* fix world language? (#444)
* change tf horovod env (#445)
* change tf horovod env
* -
* move exp name
* standardize cli jobs (#446)
* test
* standardize training jobs
* online endpoints (#435)
* online endpoints
* run readme.py; manual edit workflows
* fix delete command
* fix
* fix endpoint
* rerun readme
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* quick fix (#448)
* run readme (#449)
* docs updates (#450)
* minor edit for docs (#451)
* docs updates
* quick fix
* undo sweep hack (#452)
* edits for docs (#453)
* add hello world back
* updates
* minor edits
* Update python-ml-basic-cpu.yml
* Add REST examples (#439)
* Add scripts without blob cli
* Everything working for endpoint except discrimnator issue with online deployment
* Working simple job flow
* Use default container for data
* Rename scripts, make wait_for_completion a function, fix online deployment
* Add scoring to endpoint
* Change structure of folders
* Remove todo
* Move rest to cli folder
* Switch to lightgbm from xgboost
* Fix wait_for_completion
* Get AZURE_STORAGE_KEY from az cli
* Change MIR region to westus
* Remove exports from variables
* Fix scoring the endpoint
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
* Fix closing tag of sample to match opening tag. (#455)
* updates for docs (#456)
* switch cli version back (#457)
* /
* tests for rest how-to (#454)
* run readme
* save
* test rest with azureml-examples
* fixes
* run readme
* Use old way of getting certain variables
Co-authored-by: tmccrmck <tjmcc01@gmail.com>
* Update how-to-batch-score.sh (#436)
Fix batch asset path
* force add files (#458)
* endpoint script fix (#459)
* fix endpoints
* fix endpoints
* fix endpoint script
* fix endpoints
* edits for docs (#461)
* use
* exp name edits
* bash edits
* save
* save
* edit
* Get storage info from default datastore (#460)
* simplify hello world
* move RG (#463)
* switch credentials (#464)
* debug (#465)
* debug
* fix typo
* fjeiwpaol tefjpao
* Tweak to canary-flow shellscript (#466)
* Fix `test_green` snippet
`test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html
* Results of `python readme.py`
This seems erroneous to me, but...
* Changed endpoint name to my-new-endpoint per rest of script
* fixed typo in var name
Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com>
* Add managed identity shell script for online endpoint (#467)
* Online endpoint managed identities tests shell script
* Updated online endpoint managed identity shell scripts
Co-authored-by: pansav <pansav@microsoft.com>
* imperative flow (#469)
* .yaml -> .yml (#470)
* .yaml -> yml
* update python files
* edit
* Add get deployment logs to rest docs (#473)
* Add getLogs API call
* Fix typo
* minor edit to hello world (#474)
* add -example to exp name
* edits for docs (#476)
* hacks
* order of operations
* edits
* increment model
* edits to batch
* fix readme
* reduce training times to 1 epoch
* Update setup.sh
* >- -> > (#477)
* update CLI version (#480)
* update CLI version
* try 0.71
* use 1.0.0a2
* Gopalv/tfserving sample (#479)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* Fixed deployment yaml for online endpoint managed identity example (#482)
Co-authored-by: pansav <pansav@microsoft.com>
* updates to cli-preview (#484)
* testing
* update dockerfiles
* swithc back to conda yml
* test
* ?
* ?
* dead snakes
* fix job
* remove non-python jobs
* test
* job descriptions
* asset descritptions
* docs
* save
* saveing
* Fixed online endpoint automation failures issues (#489)
* Fixed automation failures issues
* Removed try catch block from run method
* Fixed declarative yaml
* Updated docker image uri to Ubuntu 18.04
* Update system identity flow to use endpoint show to retrieve object id
Co-authored-by: pansav <pansav@microsoft.com>
* Add how-to-train-autorest with Python SDK (#481)
* Add autorest example
* Fixes for REST scripts
* Removed linux specific code in shell script (#492)
* Removed linux specific code in shell script
* Added delay after system identity is created and also changed secrets id
* Changed CLI version for managed identity tests
* Fixed declarative yaml file and upgraded to latest CLI version for managed identity
* Changed user identity place holder name
* Changed online endpoint schema url in yaml files
* Fixed endpoint delete command in declarative flow
* Fixed file path in declarative sample
Co-authored-by: pansav <pansav@microsoft.com>
* Fix how-to-deploy-script (#491)
* Fix how-to-deploy-script
* Use random versions for environmetns
* Don't use headers for async operations
* Use jq -r everywhere
* Print async operation error
* Try changing endpoint
* Add mlflow model file (#486)
* update batch sample (#493)
* Gopalv/triton sample (#490)
* WIP
* adding logic to extract ACR
* ran readme.py
* local testing works
* sed works
* parametrized, added local and remote test
* adding workflow file
* acr login failing
* adding sleep after docker run
* workflows
* updating yaml and script to exit properly
* adding deletion
* fixing typos
* making base endpoint more generic
* try after deletion
* renaming tfserving
* moved file to correct place
* works until MFE error
* not tested locally
* new CLI drop seems to work
* adding triton endpoint
* using anonymous environments
* adding back placeholder
* tfserving fails for some reason
* triton works
* get different acr
* install pyindex
* deleting model
* fixing model delete
* factoring cleanup utility
* adding exception handling for acr fail
* fixing syntax error
* making image url an argument
* using public image for tfserving
* add back tutorial test, rename
* manual modify of recently added workflow
* repin black; minor edit
* run black-nb
* fix typo
* test horovod env (#496)
* test horovod env
* revert to old environment
* try v2
* revert
* try oss horovod
* try tf image
* try horoovd image
* switch horovod env
* 1.0.0a2 -> 1.0.0a4
* update batch sample (#497)
* Removed min/max instances and changed auth_mode to key (#498)
Co-authored-by: pansav <pansav@microsoft.com>
* saving (#499)
* minor reqs edits
Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com>
Co-authored-by: Harneet Virk <harnvir@microsoft.com>
Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com>
Co-authored-by: trent <tjmcc01@gmail.com>
Co-authored-by: Trent McCormick <trmccorm@microsoft.com>
Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com>
Co-authored-by: Larry O'Brien <lobrien@knowing.net>
Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com>
Co-authored-by: pansav <pansav@microsoft.com>
Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
|
|
|
f.write(workflow_yaml)
|
|
|
|
|
|
|
|
|
|
|
|
# run functions
|
|
|
|
if __name__ == "__main__":
|
|
|
|
# issue #146
|
|
|
|
if "posix" not in os.name:
|
|
|
|
print(
|
|
|
|
"windows is not supported, see issue #146 (https://github.com/Azure/azureml-examples/issues/146)"
|
|
|
|
)
|
|
|
|
exit(1)
|
|
|
|
|
|
|
|
# setup argparse
|
|
|
|
parser = argparse.ArgumentParser()
|
|
|
|
parser.add_argument("--check-readme", type=bool, default=False)
|
|
|
|
args = parser.parse_args()
|
|
|
|
|
|
|
|
# call main
|
|
|
|
main(args)
|