azureml-examples/cli/readme.py

405 строки
11 KiB
Python
Исходник Обычный вид История

refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
# imports
import os
import json
import glob
import argparse
# define constants
2021-05-26 16:58:03 +03:00
EXCLUDED_JOBS = ["cifar"]
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
EXCLUDED_ENDPOINTS = ["conda.yml", "environment.yml", "batch", "online"]
EXCLUDED_ASSETS = [
"conda.yml",
"environment.yml",
"conda-envs",
"mlflow-models",
"workspace",
]
EXCLUDED_SCRIPTS = ["setup", "cleanup"]
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
# define functions
def main(args):
# get list of jobs
jobs = sorted(glob.glob("jobs/**/*job*.yml", recursive=True))
jobs += sorted(glob.glob("jobs/*.yml", recursive=False))
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
jobs = [
job.replace(".yml", "")
for job in jobs
if not any(excluded in job for excluded in EXCLUDED_JOBS)
]
# get list of endpoints
endpoints = sorted(glob.glob("endpoints/**/*.yml", recursive=True))
endpoints = [
endpoint.replace(".yml", "")
for endpoint in endpoints
if not any(excluded in endpoint for excluded in EXCLUDED_ENDPOINTS)
]
# get list of assets
assets = sorted(glob.glob("assets/**/*.yml", recursive=True))
assets = [
asset.replace(".yml", "")
for asset in assets
if not any(excluded in asset for excluded in EXCLUDED_ASSETS)
]
# get list of scripts
scripts = sorted(glob.glob("*.sh", recursive=False))
scripts = [
script.replace(".sh", "")
for script in scripts
if not any(excluded in script for excluded in EXCLUDED_SCRIPTS)
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
]
# write workflows
write_workflows(jobs, endpoints, assets, scripts)
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
# read existing README.md
with open("README.md", "r") as f:
readme_before = f.read()
# write README.md
write_readme(jobs, endpoints, assets, scripts)
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
# read modified README.md
with open("README.md", "r") as f:
readme_after = f.read()
# check if readme matches
if args.check_readme:
if not check_readme(readme_before, readme_after):
print("README.md file did not match...")
exit(2)
def write_readme(jobs, endpoints, assets, scripts):
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
# read in prefix.md and suffix.md
with open("prefix.md", "r") as f:
prefix = f.read()
with open("suffix.md", "r") as f:
suffix = f.read()
# define markdown tables
jobs_table = "\n**Jobs** ([jobs](jobs))\n\npath|status|description\n-|-|-\n"
endpoints_table = (
"\n**Endpoints** ([endpoints](endpoints))\n\npath|status|description\n-|-|-\n"
)
assets_table = "\n**Assets** ([assets](assets))\n\npath|status|description\n-|-|-\n"
scripts_table = "\n**Scripts**\n\npath|status|\n-|-\n"
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
# process jobs
for job in jobs:
# build entries for tutorial table
status = f"[![{job}](https://github.com/Azure/azureml-examples/workflows/cli-{job.replace('/', '-')}/badge.svg)](https://github.com/Azure/azureml-examples/actions?query=workflow%3Acli-{job.replace('/', '-')})"
description = "*no description*"
try:
with open(f"{job}.yml", "r") as f:
for line in f.readlines():
if "description: " in str(line):
description = line.split(": ")[-1].strip()
break
except:
pass
# add row to tutorial table
row = f"[{job}.yml]({job}.yml)|{status}|{description}\n"
jobs_table += row
# process endpoints
for endpoint in endpoints:
# build entries for tutorial table
status = f"[![{endpoint}](https://github.com/Azure/azureml-examples/workflows/cli-{endpoint.replace('/', '-')}/badge.svg)](https://github.com/Azure/azureml-examples/actions?query=workflow%3Acli-{endpoint.replace('/', '-')})"
description = "*no description*"
try:
with open(f"{endpoint}.yml", "r") as f:
for line in f.readlines():
if "description: " in str(line):
description = line.split(": ")[-1].strip()
break
except:
pass
# add row to tutorial table
row = f"[{endpoint}.yml]({endpoint}.yml)|{status}|{description}\n"
endpoints_table += row
# process assets
for asset in assets:
# build entries for tutorial table
status = f"[![{asset}](https://github.com/Azure/azureml-examples/workflows/cli-{asset.replace('/', '-')}/badge.svg)](https://github.com/Azure/azureml-examples/actions?query=workflow%3Acli-{asset.replace('/', '-')})"
description = "*no description*"
try:
with open(f"{asset}.yml", "r") as f:
for line in f.readlines():
if "description: " in str(line):
description = line.split(": ")[-1].strip()
break
except:
pass
# add row to tutorial table
row = f"[{asset}.yml]({asset}.yml)|{status}|{description}\n"
assets_table += row
# process scripts
for script in scripts:
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
# build entries for tutorial table
status = f"[![{script}](https://github.com/Azure/azureml-examples/workflows/cli-scripts-{script}/badge.svg)](https://github.com/Azure/azureml-examples/actions?query=workflow%3Acli-scripts-{script})"
link = f"https://scripts.microsoft.com/azure/machine-learning/{script}"
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
# add row to tutorial table
row = f"[{script}.sh]({script}.sh)|{status}\n"
scripts_table += row
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
# write README.md
print("writing README.md...")
with open("README.md", "w") as f:
f.write(
prefix
+ scripts_table
+ jobs_table
+ endpoints_table
+ assets_table
+ suffix
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
)
def write_workflows(jobs, endpoints, assets, scripts):
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
print("writing .github/workflows...")
# process jobs
for job in jobs:
# write workflow file
write_job_workflow(job)
# process endpoints
for endpoint in endpoints:
# write workflow file
# write_endpoint_workflow(endpoint)
pass
# process assest
for asset in assets:
# write workflow file
write_asset_workflow(asset)
# process scripts
for script in scripts:
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
# write workflow file
write_script_workflow(script)
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
def check_readme(before, after):
return before == after
def parse_path(path):
filename = None
project_dir = None
hyphenated = None
try:
filename = path.split("/")[-1]
except:
pass
try:
project_dir = "/".join(path.split("/")[:-1])
except:
pass
try:
hyphenated = path.replace("/", "-")
except:
pass
return filename, project_dir, hyphenated
def write_job_workflow(job):
filename, project_dir, hyphenated = parse_path(job)
creds = "${{secrets.AZ_AE_CREDS}}"
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
workflow_yaml = f"""name: cli-{hyphenated}
on:
schedule:
- cron: "0 0/4 * * *"
pull_request:
branches:
- main
- cli-preview
paths:
- cli/{project_dir}/**
- .github/workflows/cli-{hyphenated}.yml
jobs:
build:
runs-on: ubuntu-latest
steps:
- name: check out repo
uses: actions/checkout@v2
- name: azure login
uses: azure/login@v1
with:
creds: {creds}
- name: install ml cli
run: az extension add -n ml -y
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
- name: setup
run: bash setup.sh
working-directory: cli
2021-06-08 23:07:59 +03:00
continue-on-error: true
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
- name: create job
run: |
run_id=$(az ml job create -f {job}.yml --query name -o tsv)
az ml job stream -n $run_id
status=$(az ml job show -n $run_id --query status -o tsv)
echo $status
if [[ $status == "Completed" ]]
then
echo "Job completed"
elif [[ $status == "Failed" ]]
then
echo "Job failed"
exit 1
else
echo "Job status not failed or completed"
exit 2
fi
working-directory: cli\n"""
# write workflow
with open(f"../.github/workflows/cli-{job.replace('/', '-')}.yml", "w") as f:
f.write(workflow_yaml)
def write_endpoint_workflow(endpoint):
filename, project_dir, hyphenated = parse_path(endpoint)
creds = "${{secrets.AZ_AE_CREDS}}"
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
workflow_yaml = f"""name: cli-{hyphenated}
on:
schedule:
- cron: "0 0/4 * * *"
pull_request:
branches:
- main
- cli-preview
paths:
- cli/{project_dir}/**
- .github/workflows/cli-{hyphenated}.yml
jobs:
build:
runs-on: ubuntu-latest
steps:
- name: check out repo
uses: actions/checkout@v2
- name: azure login
uses: azure/login@v1
with:
creds: {creds}
- name: install ml cli
run: az extension add -n ml -y
2021-06-08 23:07:59 +03:00
- name: setup
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
run: bash setup.sh
working-directory: cli
2021-06-08 23:07:59 +03:00
continue-on-error: true
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
- name: create endpoint
run: az ml endpoint create -f {endpoint}.yml
working-directory: cli\n"""
# write workflow
with open(f"../.github/workflows/cli-{hyphenated}.yml", "w") as f:
f.write(workflow_yaml)
def write_asset_workflow(asset):
filename, project_dir, hyphenated = parse_path(asset)
creds = "${{secrets.AZ_AE_CREDS}}"
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
workflow_yaml = f"""name: cli-{hyphenated}
on:
schedule:
- cron: "0 0/4 * * *"
pull_request:
branches:
- main
- cli-preview
paths:
- cli/{asset}.yml
- .github/workflows/cli-{hyphenated}.yml
jobs:
build:
runs-on: ubuntu-latest
steps:
- name: check out repo
uses: actions/checkout@v2
- name: azure login
uses: azure/login@v1
with:
creds: {creds}
- name: install ml cli
run: az extension add -n ml -y
2021-06-08 23:07:59 +03:00
- name: setup
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
run: bash setup.sh
working-directory: cli
2021-06-08 23:07:59 +03:00
continue-on-error: true
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
- name: create asset
run: az ml {asset.split('/')[1]} create -f {asset}.yml
working-directory: cli\n"""
# write workflow
with open(f"../.github/workflows/cli-{hyphenated}.yml", "w") as f:
f.write(workflow_yaml)
def write_script_workflow(script):
filename, project_dir, hyphenated = parse_path(script)
creds = "${{secrets.AZ_AE_CREDS}}"
workflow_yaml = f"""name: cli-scripts-{hyphenated}
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
on:
schedule:
- cron: "0 0/4 * * *"
pull_request:
branches:
- main
- cli-preview
paths:
- cli/{script}.sh
- .github/workflows/cli-scripts-{hyphenated}.yml
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
jobs:
build:
runs-on: ubuntu-latest
steps:
- name: check out repo
uses: actions/checkout@v2
- name: azure login
uses: azure/login@v1
with:
creds: {creds}
- name: install ml cli
run: az extension add -n ml -y
2021-06-08 23:07:59 +03:00
- name: setup
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
run: bash setup.sh
working-directory: cli
2021-06-08 23:07:59 +03:00
continue-on-error: true
- name: scripts installs
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
run: sudo apt-get upgrade -y && sudo apt-get install uuid-runtime jq -y
- name: test script script
run: set -e; bash -x {script}.sh
2021-05-26 16:58:03 +03:00
working-directory: cli\n"""
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
# write workflow
with open(f"../.github/workflows/cli-scripts-{hyphenated}.yml", "w") as f:
refactor for CLI 2.0 preview (#495) * scaffolding * fixes, might break * update contributing * fix * try 0.63 * fix readme * order of cells * fix env * uncomment lines * try a job * add job test * try miniconda image * try mcr image * -n -> --name for az ml workspace create * fix workspace create * same * isolate rg * fix filename * simply env; increment version * add mlflow * fix enviornment, increment version * switch to .sh * add workspace create * quick fix * fix typo * add asset creation * debugging * fix action * fix * typo * fix typo * workflow edits * fix? * edits * fix data? * Add more commands * try something * remove notebook * update * fix? * try data workaround * fix training script * add azureml-mlflow to env * saving * try sweep job * sweep edits * fix yaml * remove limits * testing * edits * minor edits to basic.yml * add exp name * saving * save; green? * move --web to untested.sh * hmm * add model assets * remove dsstore * saving w/ model create * add endpoints dir * try something for docs * try for docs * try anon data and assets * edit * path edit * fix asset path * move stuff around * try ./ * fix mistake * save * add mount * fix * change model to use run * remove model register * data hack * test * test registered file dataset * saving * testing * whoops * change strategy * fix name * switch rg back * switch creds * switch lightgbm back * fix ws * add file: * remove deepspeed * try tensorflow * remove tensorflow * edits for docs * add verify * unisntall -> remove * -0 * saving * add check_extensions * fix mistake * saving; might break * update manage-assets and test it * whoops * minor edits * local hello world * edits * simplify workflow * try compute job * ? * try compute type * fix hello world * fix? * major refactor * remove line * comments * save * save * order of operations * saving - run sweep job * magic * switch to data - possible typos * fix typos? * ... * try dedenting for sweep * edits * names * sweep hack for now * broke stuff? * test distributed * update query logic * add exp name, :1 to env * wait for job to complete * fix user error * env to version 11 * readme edit * try something * remove redundant stuff * version * try downloading cifar * try job status check * debug bash * fixes * try stuff * add back log upload * fix bash? * copy examples/endpoints from v2-preview * run formatting * add batch endpoint (#427) * hack * move compute create to setup.sh; --name to -n for workspace create * NC6 -> NC12 * try randint sweep * remove randint from sweep; download=False for pytorch-cifar * comment out logs upload from worklow * slim iris lightgbm code * try something for cifar * fix * preprocess -> process * mlflow.pytorch log model for cifar * no mlflow, fix cifar * run formatting * relocate hello world * whoops * cifar epochs 20 -> 2 * try new env for pytorch gpu * switch pytorch to mlflow logging * cleanup cifar code * fix * write to ./model? * edit batch endpoint asset path * force add model.pkl * copy workspace assets * add some schemas, more needed * add more schemas * fix schemas * Updating the schema link to CDN endpoint (#428) * Update batch endpoint test (#429) * update schemas * fix more schemas * simplify * Revert "simplify" This reverts commit 38ec77e0ca9b54051d36699573dea04c0abb583b. * Update batch endpoint sample (#432) * test python/cli refactor (#431) * saving * Update environment.yml (#430) * ? * testing * testing * testing * cli fix * fix * saving * saving * whoops * fix readme * fix readme * whoops * minor readme edit * last readme edits * add setup workspace step * fix cli tests (#433) * test * testing * fixes * save * test * fix how-to-manage-assets * minor edit * fix * fix setup-workspace * fix readme (#434) * edits for docs (#437) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * minor edits for docs (#438) * fix readme * remove sweep hack * readme fix * command => sweep in schema * saving * add back setup-workspace.sh * ? * sweep hack * Saving * edit readme asset logic * -A * >- => | * missed hello world * remove old env * blind copy * yolo * edits * edits * docs edits (#440) * docs edits * save * remove upgrade * fix * reorder * save (#442) * edit training jobs (#443) * saving * minor edit * fix world language? (#444) * change tf horovod env (#445) * change tf horovod env * - * move exp name * standardize cli jobs (#446) * test * standardize training jobs * online endpoints (#435) * online endpoints * run readme.py; manual edit workflows * fix delete command * fix * fix endpoint * rerun readme Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * quick fix (#448) * run readme (#449) * docs updates (#450) * minor edit for docs (#451) * docs updates * quick fix * undo sweep hack (#452) * edits for docs (#453) * add hello world back * updates * minor edits * Update python-ml-basic-cpu.yml * Add REST examples (#439) * Add scripts without blob cli * Everything working for endpoint except discrimnator issue with online deployment * Working simple job flow * Use default container for data * Rename scripts, make wait_for_completion a function, fix online deployment * Add scoring to endpoint * Change structure of folders * Remove todo * Move rest to cli folder * Switch to lightgbm from xgboost * Fix wait_for_completion * Get AZURE_STORAGE_KEY from az cli * Change MIR region to westus * Remove exports from variables * Fix scoring the endpoint Co-authored-by: Trent McCormick <trmccorm@microsoft.com> * Fix closing tag of sample to match opening tag. (#455) * updates for docs (#456) * switch cli version back (#457) * / * tests for rest how-to (#454) * run readme * save * test rest with azureml-examples * fixes * run readme * Use old way of getting certain variables Co-authored-by: tmccrmck <tjmcc01@gmail.com> * Update how-to-batch-score.sh (#436) Fix batch asset path * force add files (#458) * endpoint script fix (#459) * fix endpoints * fix endpoints * fix endpoint script * fix endpoints * edits for docs (#461) * use * exp name edits * bash edits * save * save * edit * Get storage info from default datastore (#460) * simplify hello world * move RG (#463) * switch credentials (#464) * debug (#465) * debug * fix typo * fjeiwpaol tefjpao * Tweak to canary-flow shellscript (#466) * Fix `test_green` snippet `test_green` snippet was dupe of `green_10pct_traffic`. Switched to syntax described in https://azure.github.io/azureml-v2-preview/_build/html/overview/concepts/endpoints/online-endpoint/scenarios/declarative-flow.html * Results of `python readme.py` This seems erroneous to me, but... * Changed endpoint name to my-new-endpoint per rest of script * fixed typo in var name Co-authored-by: Cody <54814569+lostmygithubaccount@users.noreply.github.com> * Add managed identity shell script for online endpoint (#467) * Online endpoint managed identities tests shell script * Updated online endpoint managed identity shell scripts Co-authored-by: pansav <pansav@microsoft.com> * imperative flow (#469) * .yaml -> .yml (#470) * .yaml -> yml * update python files * edit * Add get deployment logs to rest docs (#473) * Add getLogs API call * Fix typo * minor edit to hello world (#474) * add -example to exp name * edits for docs (#476) * hacks * order of operations * edits * increment model * edits to batch * fix readme * reduce training times to 1 epoch * Update setup.sh * >- -> > (#477) * update CLI version (#480) * update CLI version * try 0.71 * use 1.0.0a2 * Gopalv/tfserving sample (#479) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * Fixed deployment yaml for online endpoint managed identity example (#482) Co-authored-by: pansav <pansav@microsoft.com> * updates to cli-preview (#484) * testing * update dockerfiles * swithc back to conda yml * test * ? * ? * dead snakes * fix job * remove non-python jobs * test * job descriptions * asset descritptions * docs * save * saveing * Fixed online endpoint automation failures issues (#489) * Fixed automation failures issues * Removed try catch block from run method * Fixed declarative yaml * Updated docker image uri to Ubuntu 18.04 * Update system identity flow to use endpoint show to retrieve object id Co-authored-by: pansav <pansav@microsoft.com> * Add how-to-train-autorest with Python SDK (#481) * Add autorest example * Fixes for REST scripts * Removed linux specific code in shell script (#492) * Removed linux specific code in shell script * Added delay after system identity is created and also changed secrets id * Changed CLI version for managed identity tests * Fixed declarative yaml file and upgraded to latest CLI version for managed identity * Changed user identity place holder name * Changed online endpoint schema url in yaml files * Fixed endpoint delete command in declarative flow * Fixed file path in declarative sample Co-authored-by: pansav <pansav@microsoft.com> * Fix how-to-deploy-script (#491) * Fix how-to-deploy-script * Use random versions for environmetns * Don't use headers for async operations * Use jq -r everywhere * Print async operation error * Try changing endpoint * Add mlflow model file (#486) * update batch sample (#493) * Gopalv/triton sample (#490) * WIP * adding logic to extract ACR * ran readme.py * local testing works * sed works * parametrized, added local and remote test * adding workflow file * acr login failing * adding sleep after docker run * workflows * updating yaml and script to exit properly * adding deletion * fixing typos * making base endpoint more generic * try after deletion * renaming tfserving * moved file to correct place * works until MFE error * not tested locally * new CLI drop seems to work * adding triton endpoint * using anonymous environments * adding back placeholder * tfserving fails for some reason * triton works * get different acr * install pyindex * deleting model * fixing model delete * factoring cleanup utility * adding exception handling for acr fail * fixing syntax error * making image url an argument * using public image for tfserving * add back tutorial test, rename * manual modify of recently added workflow * repin black; minor edit * run black-nb * fix typo * test horovod env (#496) * test horovod env * revert to old environment * try v2 * revert * try oss horovod * try tf image * try horoovd image * switch horovod env * 1.0.0a2 -> 1.0.0a4 * update batch sample (#497) * Removed min/max instances and changed auth_mode to key (#498) Co-authored-by: pansav <pansav@microsoft.com> * saving (#499) * minor reqs edits Co-authored-by: Tracy Chen <47156502+tracychms@users.noreply.github.com> Co-authored-by: Harneet Virk <harnvir@microsoft.com> Co-authored-by: Sethu Raman <rsethur@users.noreply.github.com> Co-authored-by: trent <tjmcc01@gmail.com> Co-authored-by: Trent McCormick <trmccorm@microsoft.com> Co-authored-by: vmagelo <51368918+vmagelo@users.noreply.github.com> Co-authored-by: Larry O'Brien <lobrien@knowing.net> Co-authored-by: psavdekar <45885005+psavdekar@users.noreply.github.com> Co-authored-by: pansav <pansav@microsoft.com> Co-authored-by: Gopal Vashishtha <gopalv@microsoft.com>
2021-05-12 04:53:32 +03:00
f.write(workflow_yaml)
# run functions
if __name__ == "__main__":
# issue #146
if "posix" not in os.name:
print(
"windows is not supported, see issue #146 (https://github.com/Azure/azureml-examples/issues/146)"
)
exit(1)
# setup argparse
parser = argparse.ArgumentParser()
parser.add_argument("--check-readme", type=bool, default=False)
args = parser.parse_args()
# call main
main(args)