Граф коммитов

558 Коммитов

Автор SHA1 Сообщение Дата
Fernando Pérez-García 8a523e34ab
MNT: Replace print with logging.info (#623) 2022-10-06 16:05:43 +01:00
Fernando Pérez-García fbc35add32
MNT: Fix YAML syntax for Amulet jobs (#624) 2022-10-06 12:51:42 +01:00
Kenza Bouzid e94ffcd7af
BUG: Fix double root folder for heatmaps (#622)
Add kwargs to avoid confusion and remove double heatmap_folder
2022-10-06 10:55:48 +01:00
Kenza Bouzid fce049b4a2
ENH: Add Loss Outliers Analysis callback (#618)
Add LossAnalysisCallback to plot loss heatmaps + scatter plots and save
loss ranks and their statistics.

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2022-10-05 13:00:39 +01:00
Kenza Bouzid 84a16f48c2
ENH: Set additional tags to Amulet Jobs (#620)
Set additional tags to Amulet jobs beside amlt-expr-name.
2022-10-04 09:25:00 +01:00
Anton Schwaighofer 23aec049ac
MNT: Update test helpers to work with latest AML package (#621)
Use newer API for blob deletion
2022-10-04 08:48:53 +01:00
Anton Schwaighofer f5202e4476
MNT: Relax package constraints for AzureML libraries (#619)
Relax version constraints to from ==1.43.0 to >=1.42.0. Also relax
constraints for protobuf to <4.0
This breaks cleanup code of 4 tests, which will be fixed in a follow-up PR
2022-10-03 17:22:24 +01:00
Anton Schwaighofer 108ffae42a
ENH: Improve Amulet support and documentation (#615)
Merge #612 from main:
- Move all Amulet-related code to a separate module
- Add bug fix for `RANK` environment to `hi-ml` runner
- Improve documentation and add standalone example script
2022-09-27 14:21:53 +01:00
Anton Schwaighofer 681e4281d0
ENH: Improve Amulet support and documentation (#612)
- Move all Amulet-related code to a separate module
- Add bug fix for `RANK` environment to `hi-ml` runner
- Improve documentation and add standalone example script
2022-09-26 15:29:18 +00:00
Kenza Bouzid d385f331a9
ENH: Refactor extra validation epoch logic (#608)
Avoid accessing sensitive attributes from the runner
Add on_extra_val_epoch_start hook to avoid confusion
2022-09-23 09:26:34 +01:00
Fernando Pérez-García 698903924f
MNT: Bump hi-ml-multimodal version: 0.1.0 → 0.1.1 (#610)
Bump hi-ml-multimodal version: 0.1.0 → 0.1.1
2022-09-16 16:22:03 +01:00
Fernando Pérez-García 46a8d39a23
ENH: Return figure when plotting similarity map (#600)
Return figure when plotting similarity map

This is helpful when using e.g. Streamlit to capture and display the plotted figure.
2022-09-16 12:54:04 +01:00
Ozan Oktay 9ccb6474ae
ENH: Support global similarity between text and images (#606)
* add global similarity comparison between text and image inputs

* rename the file to address pytest issue

* update method naming -- PR comment

* add support for multi prompt similarities

* update tests

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* update function naming

* Improve method docstring

* fix the issue with normalising embeddings twice

* add initial version of zero-shot classification

* finalise the zero-shot classification test

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Fernando Pérez-García <fperezgarcia@microsoft.com>
2022-09-16 12:48:05 +01:00
Kenza Bouzid ad59214ed1
ENH: Make crossval html report generic to all hyperdrive runs (#605)
* Add commandline arguments to parametrise child runs arg identifier and primary metric to get best_epoch.
* Rename all crossval functions to hyperdrive ones to make it generic
2022-09-16 09:09:42 +01:00
Kenza Bouzid 133c994fbd
ENH: Enable adding tags to aml runs from LightningContainer (#603)
Add get_additional_aml_run_tag to LightningContainer to be able to add extra tags to aml runs to facilitate runs comparison and filtering.
2022-09-14 11:39:32 +01:00
Kenza Bouzid 71066ed7df
BUG: Fix top bottom tiles selection mismatch. (#599)
For small validation sets and poor performance models, we sometimes end up with some slides falsely belonging to TP/FN cases.
We need to make sure thay top_heaps contain only true cases and bottom ones, only false cases.
Check predicted vs true label before pushing into slides_heaps to make sure we only push true predictions in top_slides_heaps and false ones in bottom_slides_heaps
2022-09-13 21:24:28 +01:00
Anton Schwaighofer 822faf0cce
ENH: Add essential changes from main (#602)
* ENH: Enable logging to AzureML when running training outside AzureML (#580)
* FIX: Update env var settings for multi node (#588)
* ENH: Add a way of quickly starting runs with different seeds (#597)
* ENH: Submit jobs to Singularity via Amulet (#596)
2022-09-13 12:55:56 +01:00
Anton Schwaighofer fd12d9decb
ENH: Use a locked environment hi-ml folder (#598) 2022-09-09 17:01:50 +01:00
Melissa Bristow 040b96375b
ENH: Submit jobs to Singularity via Amulet (#596)
Enable submitting jobs to Singularity via Amulet
2022-09-09 16:35:20 +01:00
Anton Schwaighofer ca6c76e4c3
ENH: Add a way of quickly starting runs with different seeds (#597) 2022-09-09 14:19:16 +01:00
vale-salvatelli 378fbb3001
ENH: Adding gradient accumulation flag to trainer (#592)
Adding grad accumulation to trainer container params
2022-09-05 17:02:13 +01:00
Anton Schwaighofer fbe2ffec8f
ENH: Upgrade hi-ml-cpath to Python 3.9 (#589)
Switching to Python 3.9 required changes to SSL because the old code was updating an Enum, which is invalid in 3.9.
2022-09-05 16:59:39 +01:00
Harshita Sharma f7c21a622a
ENH: Dropout in transformer layers in DeepMIL (#590)
In this PR:
- transformer_dropout parameter is added to TransformerPooling and TransformerPoolingBenchmark pooling layers.
- Tests are updated with the transformer_dropout parameter.
2022-09-05 11:20:24 +01:00
Melissa Bristow 60e59eda77
FIX: Update env var settings for multi node (#588)
Update environment variable settings for multi node jobs
2022-09-02 15:35:03 +01:00
Anton Schwaighofer 1efdced5af
ENH: Enable logging to AzureML when running training outside AzureML (#580) 2022-09-02 13:35:43 +01:00
Harshita Sharma 763dc7f865
ENH: Enable stain normalization for multiple images (#587)
Enable stain normalization to work with multiple images (e.g., when a bag of tiles is given as input).
2022-09-01 13:52:59 +01:00
Fernando Pérez-García 0d66cecc40
MNT: Hotfix for incompatible linting packages (#586)
* Upgrade linting packages

* Update mccabe version

* Update pycodestyle version

* Update astroid version
2022-08-30 20:39:09 +01:00
pre-commit-ci[bot] c39f7d4b5c
ENH: Autoupdate hooks [pre-commit.ci] (#548)
* ENH: Autoupdate hooks [pre-commit.ci]

updates:
- [github.com/PyCQA/flake8: 4.0.1 → 5.0.2](https://github.com/PyCQA/flake8/compare/4.0.1...5.0.2)

* Try to trigger pre-commit autofix

* Fix linting errors

* Upgrade flake8 version

* Revert flake8 version for pathology

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Fernando Pérez-García <fperezgarcia@microsoft.com>
Co-authored-by: vale-salvatelli <vale-salvatelli@users.noreply.github.com>
2022-08-30 14:22:08 +01:00
vale-salvatelli 98c924a6d7
ENH: Adding PR curve plot for val and test for single fold (#585) 2022-08-30 09:54:29 +01:00
Kenza Bouzid a3d4fc6148
ENH: Log epoch of best checkpoint (#584)
Add a log to print the epoch of best checkpoint.
2022-08-25 12:27:42 +01:00
Melissa Bristow 802be95232 fix best checkpoint name 2022-08-24 17:48:09 +01:00
Kenza Bouzid e9a257d1ec
BUG: Cast binary labels into integers. (#583)
Predicted labels should be integers.
2022-08-23 11:49:24 +01:00
Kenza Bouzid b8720bfdd3
BUG: Fix probability scores displayed on histo plots (#581)
We used to display probabilities of ground truth labels instead of predicted ones.
2022-08-22 13:17:10 +01:00
Harshita Sharma c9651adec4
ENH: Add/reorganise metrics and correctly normalise confusion matrix (#578)
In this PR:

Average precision metric is added to multi-class (n_classes > 1) case.
AUROC is modified such that num_classes=None for binary case, as prescribed in Pytorch documentation https://torchmetrics.readthedocs.io/en/stable/classification/auroc.html.
Parameter num_classes is not explicitly given in Specificity, similar to the other binary metrics.
Hardcoded threshold=0.5 is removed from binary metrics (since this is default value).
Confusion matrix is normalized on true values.
Metrics are re-organized for readability.
2022-08-19 15:45:18 +01:00
Kenza Bouzid 952e93766b
BUG: Fix SettingWithCopyWarning and parametrize metadata validation (#577)
Use df.assign to fix SettingWithCopyWarning  warning and make metadata validation optional to be able to read only subsets of dataset csv
2022-08-19 09:33:54 +01:00
Anton Schwaighofer 20b0a793ed
DOC: Use a consistent team name across all packages (#575) 2022-08-17 17:07:48 +01:00
Kenza Bouzid a533c4f322
ENH: Add specific ImageNet encoders for transferability study (#574)
Add specific ImageNet encoders for transferability study: Resnet18/50 with and without preproc
2022-08-16 17:29:50 +01:00
Kenza Bouzid c67c89cd57
ENH: Parameterize tiles intensity scaling (#572)
Add parameter to control tiles intensity scaling to be able to run augmentations on uint8 data to save some memory and speed up computations.
2022-08-16 14:12:29 +01:00
Fernando Pérez-García 22444a3f5a
MNT: Install PyPI package in demo notebook (#570)
* MNT: Install PyPI package in demo notebook

* Remove unused character for f-string

* Fix variable name

* Fix typo

Co-authored-by: Shruthi42 <13177030+Shruthi42@users.noreply.github.com>

* Remove version from PIP install

Co-authored-by: Shruthi42 <13177030+Shruthi42@users.noreply.github.com>
2022-08-16 13:12:53 +01:00
Kenza Bouzid 2cc2d46364
ENH: Add debug_ddp flag (#573)
Add debug_ddp to assign different options to `TORCH_DISTRIBUTED_DEBUG` env variable.
2022-08-16 12:48:16 +01:00
Kenza Bouzid 346a9936c5
ENH: Add smoke test for flexible finetuning (#563)
Add a smoke test where all finetuning flags are switched on to make sure checkpoint loading and submodules initialization are completed properly.
2022-08-15 16:22:27 +01:00
Kenza Bouzid 7df6b058b8
ENH: Add pl_sync_batchnorm flag (#571)
Add pl_sync_batchnorm flag to turn on and off batch norm synchronization via commandline/config files
2022-08-15 14:30:53 +01:00
Kenza Bouzid ce1224077b
BUG: Set dataframe index column only when necessary (#569)
Index datasets dataframe by tile_id or slide_id only when necessary (e.g. the current index is not already to tile_id or slide_id columns)
* update check
2022-08-11 17:18:34 +01:00
Melissa Bristow 7aa7345cd6
FIX: Fix best checkpoint name (#568)
Fix best checkpoint filename
2022-08-11 09:55:20 +01:00
Melissa Bristow 0c23eb6d58
ENH: Softmax requires dimension choice (#567)
Softmax requires dimension choice
2022-08-10 10:31:41 +01:00
Melissa Bristow f82c67408f fix another warning 2022-08-09 18:22:45 +01:00
Melissa Bristow bc0b0ee677 ENH: Clean up logs (#565)
Clean up logs
2022-08-09 17:00:59 +01:00
Melissa Bristow aae6f1abd8
ENH: Clean up logs (#565)
Clean up logs
2022-08-09 15:03:06 +01:00
Kenza Bouzid 84b6e53464 assign dataframe_args earlier in init 2022-08-09 12:50:23 +00:00
Kenza Bouzid 66eb74965d
ENH: Add dataframe kwargs for flexible csv read (#564)
Add `dataframe_kwargs` to BaseDatasets and HistoDatamodule to be able to read only subset of csv and custom datatype
2022-08-09 13:16:33 +01:00