Граф коммитов

25 Коммитов

Автор SHA1 Сообщение Дата
Zeliang Tian efe9b8cf33
List resource before validation to avoid not-found error (#3365)
* list resource before checking

* list resource before checking

---------

Co-authored-by: Ubuntu <zetia@DevBox-zetia.1jltvvkrfgyuhl3llhmbyldkog.bx.internal.cloudapp.net>
2024-09-12 11:23:55 +08:00
Zeliang Tian 6a36786a3f
remove unhealthy Arc cluster (#3337)
Co-authored-by: Ubuntu <zetia@DevBox-zetia.1jltvvkrfgyuhl3llhmbyldkog.bx.internal.cloudapp.net>
2024-08-19 18:04:01 -07:00
Zeliang Tian 92101a9728
remove unhealthy compute and aks (#3329)
* remove unhealthy compute and aks

* remove unhealthy compute and aks

---------

Co-authored-by: Ubuntu <zetia@DevBox-zetia.1jltvvkrfgyuhl3llhmbyldkog.bx.internal.cloudapp.net>
2024-08-05 10:06:12 +08:00
Zeliang Tian b2f58696de
Zetia/ensure k8s compute (#3265)
* add ensure_amlarc_compute function

* ensure k8s compute

* ensure k8s compute

---------

Co-authored-by: Ubuntu <zetia@DevBox-zetia.1jltvvkrfgyuhl3llhmbyldkog.bx.internal.cloudapp.net>
2024-07-06 16:51:02 +08:00
Zeliang Tian 735144d3fa
detach compute before attaching, as k8s compute doesn't support update (#3030)
* detach compute before attaching

* fix typo

* refine log

---------

Co-authored-by: Ubuntu <zetia@DevBox-zetia.1jltvvkrfgyuhl3llhmbyldkog.bx.internal.cloudapp.net>
2024-03-26 11:05:53 +08:00
Diondra d43ed2adc3
Fix readme validation failures (#3003)
* Add readme template to contributing.md

* Update exclusion list

* Add more paths to exclusion file

* Add debugging statements

* Wrap in try/except

* Wrap in try/except

* Run black to format

* Add sample to exclusions list
2024-02-14 20:17:50 -08:00
kdestin 50495339ce
ci: Fix 'file does not exist' when running check-readme (#3002)
* ci: Fix computation of repo_root in check-readme.py

    __file__ is a relative path for Python 3.5-3.8, but absolute
    in Python>=3.9

    This distinction causes us to calculate the incorrect path for the
    root of the repo:

      Path.parent will return '.' once you hit the "root" of a relative
      path (i.e. Path('./README.md').parent.parent == Path('.'))

* refactor: Rename working_directory to repo_root
2024-02-13 16:39:32 -05:00
XI JIN aed87ffbfc
Readme exclusion (#2988)
Co-authored-by: Xi Jin <jinxi@microsoft.com>
2024-02-06 12:49:06 -08:00
Rehaan Bhimani e9e8589b1b
add readme exclusions (#2984) 2024-02-05 19:01:43 -07:00
Rehaan Bhimani 7b222c7a4c
add readme exclusions (#2979) 2024-02-02 15:37:08 -08:00
Brynn Yin 124d806311
[Pipeline] Update deprecated image & add missing notebook to check readme ignore files (#2978)
* Update deprecated image

Signed-off-by: Brynn Yin <biyi@microsoft.com>

* Fix more images

Signed-off-by: Brynn Yin <biyi@microsoft.com>

* Use latest version of sklearn

Signed-off-by: Brynn Yin <biyi@microsoft.com>

* Trigger component register

Signed-off-by: Brynn Yin <biyi@microsoft.com>

* Update image

Signed-off-by: Brynn Yin <biyi@microsoft.com>

* Use fixed sklearn

Signed-off-by: Brynn Yin <biyi@microsoft.com>

---------

Signed-off-by: Brynn Yin <biyi@microsoft.com>
2024-02-02 20:41:29 +08:00
Diondra cec9cbb5ee
Add readme check (#2955)
* Add script to check README.md and README template

* Refactor check-readme.sh and add messages

* Add step to cli-assets-component-pipeline workflow to test

* Update workflow

* Update workflow

* Fix syntax error

* Update action

* Fix typo

* Add exclusion logic

* Try adding readme validation to readme.py

* Update workflows with readme.py

* Update workflows with readme.py

* Update workflows with readme.py

* Update workflows with readme.py

* Fix github workspace variable

* Fix github workspace variables for exclusion check

* Fix exclusion logic

* Update readme exclusions

* Revert accidental change

* Update readme_exclusions.txt

* Revert accidental change to working directory

* Update check-readme.sh

* Update exclusion logic and file name

* Add debug message

* Update exclusion logic and file name

* FIx working directory file names

* Update exclusion logic and file name

* Add debugging statements

* Remove debugging statements

* Update readme.py and regenerate cli workflows

* Update working-directory

* Update readme to add validate readme to sdk workflows

* Move templates folder inside infra

* Add validate readme check to tutorials and sdk/python files

* Fix readme.py and revert unintended changes

* Revert unnecessary changes and update readme_validation_exclusions.txt

* Update exclusions list

* Update template

* Update readme_validation_exclusions.txt

* Update exclusions

* Replace check-readme.sh with check-readme.py

* Update readme template

* Update check-readme.py with Kevin's suggestions

* Remove 2nd CLI argument

* Add debugging messages

* Add debugging messages

* Add debugging messages

* Update debugging messages

* Update debugging messages

* Strip whitespace

* Convert sample path to string

* Fix exclusions

* Remove debugging messages

* re-format with black

* Manually update one sample with logging logic to test

* Regenerate resolved notebooks

* Remove appinsights logging step

* Exclude /home/runner/work/azureml-examples/azureml-examples/cli/jobs/automl-standalone-jobs/cli-automl-forecasting-task-github-dau

* Revert changes to notebooks

* Add whitespace back
2024-01-30 11:45:35 -08:00
Diondra 1e212cda6a
Revert "Add readme validation step" (#2951) 2024-01-12 15:13:46 -08:00
Diondra a8431e1b9c
Add readme validation step (#2748)
* Add script to check README.md and README template

* Refactor check-readme.sh and add messages

* Add step to cli-assets-component-pipeline workflow to test

* Update workflow

* Update workflow

* Fix syntax error

* Update action

* Fix typo

* Add exclusion logic

* Try adding readme validation to readme.py

* Update workflows with readme.py

* Update workflows with readme.py

* Update workflows with readme.py

* Update workflows with readme.py

* Fix github workspace variable

* Fix github workspace variables for exclusion check

* Fix exclusion logic

* Update readme exclusions

* Revert accidental change

* Update readme_exclusions.txt

* Revert accidental change to working directory

* Update check-readme.sh

* Update exclusion logic and file name

* Add debug message

* Update exclusion logic and file name

* FIx working directory file names

* Update exclusion logic and file name

* Add debugging statements

* Remove debugging statements

* Update readme.py and regenerate cli workflows

* Update working-directory

* Update readme to add validate readme to sdk workflows

* Move templates folder inside infra

* Add validate readme check to tutorials and sdk/python files

* Fix readme.py and revert unintended changes

* Revert unnecessary changes and update readme_validation_exclusions.txt

* Update exclusions list

* Update template

* Update readme_validation_exclusions.txt

* Update exclusions

* Replace check-readme.sh with check-readme.py

* Update readme template

* Update check-readme.py with Kevin's suggestions

* Remove 2nd CLI argument

* Add debugging messages

* Add debugging messages

* Add debugging messages

* Update debugging messages

* Update debugging messages

* Strip whitespace

* Convert sample path to string

* Fix exclusions

* Remove debugging messages

* re-format with black

* Manually update one sample with logging logic to test

* Regenerate resolved notebooks

* Remove appinsights logging step

* Exclude /home/runner/work/azureml-examples/azureml-examples/cli/jobs/automl-standalone-jobs/cli-automl-forecasting-task-github-dau
2024-01-12 15:36:15 -05:00
jeff-shepherd 6947079b84
Added 2 hour delay before cleanup (#2937)
* Added 2 hour delay before cleanup to prevent resource from being deleted when they are still in use

* Removed duplicate cleanup script
2024-01-04 17:50:29 -08:00
kdestin 6eb55ad0ce
fix: Temporarily allow bootstrapping to proceed past acr permission granting error (#2912) 2023-12-12 20:17:05 -05:00
Aswin Nagarajan e0032242d5
Enable create new ws for OAI v2 and changed chat dataset format (#2854)
* enable create new ws for oai

* set env var

* export env var

* setting location

* setting location

* removed acr access

* removed all unnecessary verifications

* changed dataset format
2023-11-27 01:42:40 -08:00
Aswin Nagarajan d1ed139b9b
Changes to workflow for oai-v2 examples (#2726) 2023-10-12 22:47:30 -07:00
Aswin Nagarajan a796d3a70a
OAI v2 examples (#2710)
* oaiv2 sdk example

* modified workflows and notebooks

* removed all cell outputs

* removed old files

* changed cron param

* reverted cron param

* black format

* Added cli examples for new finetune pipeline component (#2711)

* Added cli examples for new finetune pipeline component

* Added screenshots for cli examples

* added and renamed cli workflows

* directory fixes

* Vvatsalya/fix cli oai v2 workflow (#2717)

* setting location as ncus

* set in setup-cli step

* new init and setup script for oai v2

* correcting syntax for init sh

* fix init oai v2 script

* fix

* fix 1

* fix 2

* change training dataset name in cli oai v2 example

* sdk workflow region change

* use oai as suffix in ws name

* added oai v1 and v2 in readme for cli and sdk

* adding install reqs for sdk

---------

Co-authored-by: Vishal Vatsalya <98515131+vvatsalya@users.noreply.github.com>
Co-authored-by: Vishal Vatsalya <vvatsalya@microsoft.com>
Co-authored-by: Ayush Mishra <61145377+novaturient95@users.noreply.github.com>
2023-10-10 11:28:03 -07:00
MaggieHust 516cef4430
update test (#2658)
Co-authored-by: Maggie Ma <fama@microsoft.com>
2023-09-21 22:02:23 -07:00
kdestin 35041db7f7
chore: Update service principal name in init_environment.sh (#2621) 2023-09-05 20:00:24 -04:00
jeff-shepherd df7837c1e0
Switched to new GPU SKU because NC6 is deprecated (#2462)
* Switched to new GPU SKU because NC6 is deprecated

* Updated credentials for remaining V1 notebooks

* Updated gpu-cluster in bootstrap.sh
2023-07-18 16:51:47 -07:00
jeff-shepherd cfd459666c
Add upload to datastore for sampledata (#2384)
* Add upload to datastore for sampledata

* Updated SKIP_AUTO_DELETE_TILL format to 4 digit year
2023-06-20 08:21:13 -07:00
kdestin 57ff370d0f
ci: refactor bootstrapping to avoid needing to invoke `apt get` (#2336)
* refactor: Stop invoking jq in infra scripts

* refactor: Remove some commented out code from infra

* refactor: Don't install jq

    Ubuntu runners come with it pre-installed:
        4fe7f6bc86/images/linux/Ubuntu2204-Readme.md

* refactor: Replace `az command | jq '.QUERY'` with `az command --query QUERY`

* refactor: Collapse `jq | jq | jq` into a single jq invocation

* refactor: Do not install xmlstarlet

    Seem to be entirely unused

* refactor: Don't install uuid-runtime and remove install_packages function
2023-05-30 16:13:54 -04:00
kdestin f87d6591d5
refactor: Reorganize infra folder (#2328)
* refactor: Move infra scripts to subdir

* refactor: Rewrite paths in sdk_helpers.sh

* refactor: Update ROOT_DIR

* refactor: Update paths in workflow generators

* refactor: Update path in doc comment

* refactor: Update workflows

* fix: Fix incorrect script path
2023-05-24 15:56:58 -04:00