Граф коммитов

923 Коммитов

Автор SHA1 Сообщение Дата
Fred Park e252f4be0d
Update jobs doc regarding envlist
- Environment variables are already sourced. Further expansion of
environment variables should be performed by shell invocation or typical
conventions by the invoking program (#314)
2019-09-17 16:03:39 +00:00
Fred Park 0244614034
Fix native output_data to observe full remote path
- Resolves #313
2019-09-13 19:56:57 +00:00
Fred Park bc9b872bba
Tag for 3.8.2 release 2019-09-12 16:38:07 +00:00
Fred Park a02e35d329
Fix task output_data on native pools with includes
- Resolves #313
2019-09-12 16:15:53 +00:00
Fred Park 43fda94278
Update drivers and dependencies
- Docker CE 19.03.2
- blobxfer to 1.9.2
- NC/ND driver to 418.87.00
2019-09-11 17:51:13 +00:00
Fred Park 89dff6b201
Prevent job submission on older pools
- Resolves #312
2019-09-11 17:44:56 +00:00
Fred Park 74d339c413
Add doc note regarding Windows env vars
- Update DotNet recipe to use container default working dir option
- Partially addresses #311
2019-09-09 15:26:21 +00:00
Fred Park 1706902959
Fix non-native data transfer sequence coupling
- Non-native input_data or output_data of azure_storage type with
sequences greater than 1 would have each individual action depend upon
the success of the prior action
- Resolves #310
2019-09-05 19:29:33 +00:00
Fred Park cbf137422e
Fix task termination for infinite retry tasks
- Resolves #308
2019-09-03 15:12:10 +00:00
Fred Park 03046aa692
Fix possible null from node error value collection
- Resolves #309
2019-08-30 21:13:22 +00:00
Fred Park 0d5850c8c9
Fix task termination in non-native mode
- SSH side-channel docker kill signal was not being sent as Docker tasks
were not being detected properly
- Also fix issue with pool images update not executing if block on
images is false
- Resolves #308
2019-08-30 20:59:51 +00:00
Fred Park 29c368ffd9
Fix non-string env var in recurring jobs
- Instead of attempting to coalesce all environment variables as strings
(which may not round trip properly), add ruamel.yaml as a dependency in
the recurrent job manager
- Resolves #306
2019-08-30 17:14:13 +00:00
Fred Park 0e773c5158
Update docs regarding AAD and subscription id
- Provide better error message in this case of missing subscription id
- Resolves #305
2019-08-30 15:53:19 +00:00
Fred Park 3a91511e50
Fix possible null node agent info on list nodes
- Resolves #307
2019-08-30 15:30:49 +00:00
Fred Park 88c3cdf8be
Fix prefix filter on task factory remote_path
- Resolves #303
2019-08-29 16:53:47 +00:00
Fred Park d77d8a8cce
Fix download cascade outputs on start task failure 2019-08-23 15:42:35 +00:00
Fred Park 0fc3522c50
Tag for 3.8.1 release 2019-08-19 16:24:33 +00:00
Fred Park 8b7b17f465
Fix Task Runner regressions
- Input/output data phases not correctly triggered for multi-instance
and MPI jobs
- Output data was not triggered at all
- Pre-exec triggering on native
- Resolves #301
2019-08-16 17:07:44 +00:00
Fred Park 07e86a3928
Fix Network Direct RDMA VM provisioning
- Resolves #299
2019-08-14 21:56:45 +00:00
Fred Park 4ab382761a
Minor post-release fixes
- Update 3rd party notices
- Move docker directory to build
- Fix old info in a few recipes
2019-08-14 15:13:28 +00:00
Fred Park ff49d187a4
Tag for 3.8.0 release 2019-08-14 03:23:09 +00:00
Fred Park 826c46afe2
Bring your own Public IP support 2019-08-14 03:23:09 +00:00
Fred Park e9130f83f4
MCR migration
- Migrate images to Microsoft Container Registry
- Fix Shellcheck issues
- Related to #278
2019-08-14 03:23:03 +00:00
Fred Park 290209381e
Update Dependencies
- Update NVIDIA compute driver to 418.67
- Update NVIDIA grid driver to 430.30
- Update Batch Insights to 1.3.0
- Update blobxfer to 1.9.0
- Update Python dependencies
- Drop Python 3.4 support
2019-08-12 20:42:32 +00:00
Fred Park 3052e98c8b
Add MVAPICH support
- More changes for #287
- Automatically source environment modules if it exists
- Fix some typos
2019-08-12 01:58:39 +00:00
Fred Park be52a9c3b0
Various updates
- Fail VM provisioning if expected IB card is not present
- Update platform image native support
2019-08-12 01:58:28 +00:00
Fred Park b6044b3489
Update GPU support
- Update to Docker CE 19.03.1
- Use "native" Docker/containerd GPU support
- Breaking change in jobs configuration to allow arbitrary configuration
- Update docs
- Resolves #293
2019-08-08 20:36:41 +00:00
Fred Park 00f1c95b1d
Update Dockerfiles to Alpine 3.10 2019-08-08 20:11:33 +00:00
Fred Park e6709409a2
Update to Singularity 3.3.0
- Check for expected ephemeral mount point
2019-08-07 21:13:30 +00:00
Fred Park caec6b566f
Allow Premium File Shares via AAD
- Documentation clarification around the main storage account
- Resolves #294
2019-08-05 18:29:27 +00:00
Fred Park 7ae3cb9e50
Merge branch 'master' into singularity3 2019-08-05 18:28:19 +00:00
Vincent Labonté b64c3cb324 Add Infiniband support with Open MPI and MPICH (#297)
* Add Infinibnad support with Open MPI

* Add mpiBench-Infiniband-OpenMPI recipe

* Add setup script for OpenFOAM-Infiniband-OpenMPI recipe

* Update setup script for OpenFOAM-Infiniband-OpenMPI recipe

* Add OpenFOAM-Infiniband-OpenMPI recipe

* Add documentation for recipes

* Add Infiniband support with MPICH

* Add mpiBench-Infiniband-MPICH recipe
2019-08-05 10:39:08 -04:00
Fred Park d3fccd613d
Tag for 3.7.1 release 2019-07-24 02:55:52 +00:00
Fred Park 3c376224a3
Fix GPU node provisioning
- Start task failures due to docker-ce-cli info changing output
- Pin docker-ce-cli
- Make docker root dir parsing more robust
- Fix LIS and CentOS 7.6 GPU provisioning
- Resolves #291
2019-07-24 02:55:35 +00:00
Fred Park 4d69c96d79
Merge branch 'sriov-merge' into singularity3 2019-07-23 21:02:52 +00:00
Vincent Labonté cc42916cba Fixes and update of recipes (#290)
* Fix multi-instance tasks that are not a MPI task

* Add setup task script for CNTK-CPU-Infiniband-IntelMPI

* Update CNTK-CPU-Infiniband-IntelMPI recipe

* Add MPI executable path option

* Update CNTK-CPU-OpenMPI recipe

* Change the default MPI executable_path to mpirun

* Modify CNTK-CPU-Infiniband-IntelMPI recipe

* Add setup task script for CNTK-GPU-Infiniband-IntelMPI

* Update CNTK-GPU-Infiniband-IntelMPI recipe

* Add setup task script for CNTK-GPU-OpenMPI

* Add setup task script for NAMD-Infiniband-IntelMPI

* Update NAMD-Infiniband-IntelMPI recipe

* Add setup task script for OpenFOAM-Infiniband-IntelMPI

* Update OpenFOAM-Infiniband-IntelMPI recipe

* Update TensorFlow-GPU Singularity recipe

* Add setup task script for OpenFOAM-TCP-OpenMPI

* Update OpenFOAM-TCP-OpenMPI recipe

* Add support for arbitrary commands with the MPI processes_per_node option

* Fix MPI with native images

* Modify CNTK-CPU-Infiniband-IntelMPI recipe

* Modify CNTK-GPU-Infiniband-IntelMPI recipe

* Modify NAMD-Infiniband-IntelMPI recipe

* Update processes_per_node documentation

* Fix `pool images list` with Singularity images

* Modify OpenFOAM-Infiniband-IntelMPI set up script

* Add check for mpi setting with Windows

* Add auto scratch support with OpenFOAM-Infiniband-IntelMPI recipe

* Modify OpenFOAM-TCP-OpenMPI set up script

* Add auto scratch support with OpenFOAM-TCP-OpenMPI recipe

* Add mpiBench-IntelMPI recipe

* Add mpiBench-MPICH recipe

* Add mpiBench-OpenMPI recipe

* Resolve PR comments

* Resolve PR comments
2019-07-17 18:57:06 -07:00
Fred Park ce0caaa24d
Add promo VM size (NC/NV/H) support 2019-07-16 16:07:03 +00:00
Fred Park e361008550
Update Alpine and Python
- Alpine to 3.10
- Python to 3.7.4
2019-07-15 03:32:24 +00:00
Fred Park ec3c6f49be
Update Intel Benchmarks using Hc
- Docker and Singularity3 updates
2019-07-15 03:32:24 +00:00
Fred Park 25fec92273
Support Hc/Hb
- Support RDMA bifurcation
- Update platform docs for CentOS-HPC 7.6
2019-07-15 03:32:04 +00:00
Fred Park 9736cea593
Update NAMD-TCP recipe
- Fix connection issues
2019-07-09 21:46:34 +00:00
Fred Park 559463cd12
Merge branch 'develop' into sriov-merge 2019-07-09 21:45:31 +00:00
Vincent Labonté 442a22bd28 Improve MPI Interface for Singularity and Docker (#289)
* Add MPI config support for MPICH

* Add MPI config support for Docker containers

* Resolve PR comments

* Make use of the script runner with MPI and Docker

* Minor fixes

* Resolve PR comments
2019-07-09 13:46:12 -07:00
Vincent Labonté e6e60048a7 Improve MPI Interface for Intel MPI and Open MPI with Singularity images (#288)
* Add MPI config support for IntelMPI

* Separate prologue command into user and system

* Add MpiSettings

* Add MPI config support for Open MPI

* Fix MPI config support for IntelMPI

* Workaround for Open MPI btl tcp

* Correct documentation

* Fix non mpi multi instance execution

* Resolve PR comments

* Resolve PR comments

* Partially address #287
2019-07-03 12:40:54 -07:00
Fred Park 878cc18487
Update docs for Shared Image Gallery support 2019-06-27 20:49:37 +00:00
Fred Park 4b9a004f1a
Update to Batch 7.0.0 SDK
- Breaking change: pool listskus -> account images
- Support setting working directory for native mode
- Resolves #286
2019-06-27 20:08:49 +00:00
Fred Park b93f60213d
Support conditional output data
- Resolves #230
2019-06-24 18:03:43 +00:00
Fred Park 7b138e785a
Support user-specified job prep/release tasks
- Host mode only
- Resolves #202
2019-06-24 16:02:30 +00:00
Fred Park eb3c70bbf5
Fix autoscratch setup issue 2019-06-21 19:55:27 +00:00
Vincent Labonté 9f58ad0042 Fixes for Singularity 3 support (#285)
* Fix credentials when running task with Singularity docker:// images

* Fix Singularity cache directory's ownership

* Fix images update command

* Fix running cascade with use_shipyard_docker_image

* Remove envfile dump in task runner
2019-06-21 10:36:28 -07:00