Граф коммитов

8 Коммитов

Автор SHA1 Сообщение Дата
Shruthi42 aa09b9db31
Register all models after training, not only Segmentation models. (#455)
This PR changes the codepath so all models trained on AzureML are registered. The codepath previously allowed only segmentation models (subclasses of `SegmentationModelBase`) to be registered. Models are registered after a training run or if the `only_register_model` flag is set. Models may be legacy InnerEye config-based models or may be defined using the LightningContainer class.

The PR also removes the AzureRunner conda environment. The full InnerEye conda environment is needed to submit a training job to AzureML.

It splits the `TrainHelloWorldAndHelloContainer` job in the PR build into two jobs, `TrainHelloWorld` and `TrainHelloContainer`. It adds a pytest marker `after_training_hello_container` for tests that can be run after training is finished in the `TrainHelloContainer` job.

This will solve the issue of model registration in #377 and #398.
2021-05-12 15:03:35 +01:00
Anton Schwaighofer c298155753
Fixing bugs when running container models on multiple GPUs (#445)
- The use_gpu flag for container models was not picked up correctly, always running without GPU
- When running inference for container models with the test_step method, PL would fail when running on >1 GPU
- Adds an extra test to run the HelloContainer model in AzureML
2021-04-23 17:15:37 +01:00
Anton Schwaighofer 3fa74c2a2c
Remove pre-processing of source version message (#356)
Pre-processing of source code message causes problems when those contains shell special characters. Remove and rely on git package to pick those up
2021-01-11 15:50:26 +00:00
Anton Schwaighofer 1e86bfd008
Ensure that PR builds fail on any job errors, fix component governance (#317)
- The "TrainViaSubmodule" step presently only fails if the last python call fails. Fix that.
- Component Governance was accidentally disabled in #290
2020-11-16 16:27:10 +00:00
Anton Schwaighofer 7a98d4d62d
Add user alias for notifications, add max run duration (#271)
- Supply user alias and/or email address, so that notifications can be sent
- Add argument for maximum run duration, limit PR build to 1h
2020-10-06 13:50:16 +01:00
Anton Schwaighofer 3e8b92d0f1
Shorten the most frequent commandline options, rename settings file (#232)
Rename commandline options: --submit_to_azureml -> --azureml, --is_train -> --train, --gpu_cluster_name -> --cluster
Rename train_variables.yml -> settings.yml
2020-09-21 17:40:05 +01:00
David Carter a372f49e13
Sanitize source version message (#182)
* Sanitize source version message

* Sanitize source version message

* Syntax

* Allow longer build source message

Co-authored-by: Shruthi42 <13177030+Shruthi42@users.noreply.github.com>
2020-08-28 16:08:14 +01:00
Shruthi42 d6a3d73ccf Add source code 2020-07-29 00:30:35 +05:30