Fred Park
e252f4be0d
Update jobs doc regarding envlist
...
- Environment variables are already sourced. Further expansion of
environment variables should be performed by shell invocation or typical
conventions by the invoking program (#314 )
2019-09-17 16:03:39 +00:00
Fred Park
0244614034
Fix native output_data to observe full remote path
...
- Resolves #313
2019-09-13 19:56:57 +00:00
Fred Park
bc9b872bba
Tag for 3.8.2 release
2019-09-12 16:38:07 +00:00
Fred Park
a02e35d329
Fix task output_data on native pools with includes
...
- Resolves #313
2019-09-12 16:15:53 +00:00
Fred Park
43fda94278
Update drivers and dependencies
...
- Docker CE 19.03.2
- blobxfer to 1.9.2
- NC/ND driver to 418.87.00
2019-09-11 17:51:13 +00:00
Fred Park
89dff6b201
Prevent job submission on older pools
...
- Resolves #312
2019-09-11 17:44:56 +00:00
Fred Park
74d339c413
Add doc note regarding Windows env vars
...
- Update DotNet recipe to use container default working dir option
- Partially addresses #311
2019-09-09 15:26:21 +00:00
Fred Park
1706902959
Fix non-native data transfer sequence coupling
...
- Non-native input_data or output_data of azure_storage type with
sequences greater than 1 would have each individual action depend upon
the success of the prior action
- Resolves #310
2019-09-05 19:29:33 +00:00
Fred Park
cbf137422e
Fix task termination for infinite retry tasks
...
- Resolves #308
2019-09-03 15:12:10 +00:00
Fred Park
03046aa692
Fix possible null from node error value collection
...
- Resolves #309
2019-08-30 21:13:22 +00:00
Fred Park
0d5850c8c9
Fix task termination in non-native mode
...
- SSH side-channel docker kill signal was not being sent as Docker tasks
were not being detected properly
- Also fix issue with pool images update not executing if block on
images is false
- Resolves #308
2019-08-30 20:59:51 +00:00
Fred Park
29c368ffd9
Fix non-string env var in recurring jobs
...
- Instead of attempting to coalesce all environment variables as strings
(which may not round trip properly), add ruamel.yaml as a dependency in
the recurrent job manager
- Resolves #306
2019-08-30 17:14:13 +00:00
Fred Park
0e773c5158
Update docs regarding AAD and subscription id
...
- Provide better error message in this case of missing subscription id
- Resolves #305
2019-08-30 15:53:19 +00:00
Fred Park
3a91511e50
Fix possible null node agent info on list nodes
...
- Resolves #307
2019-08-30 15:30:49 +00:00
Fred Park
88c3cdf8be
Fix prefix filter on task factory remote_path
...
- Resolves #303
2019-08-29 16:53:47 +00:00
Fred Park
d77d8a8cce
Fix download cascade outputs on start task failure
2019-08-23 15:42:35 +00:00
Fred Park
0fc3522c50
Tag for 3.8.1 release
2019-08-19 16:24:33 +00:00
Fred Park
8b7b17f465
Fix Task Runner regressions
...
- Input/output data phases not correctly triggered for multi-instance
and MPI jobs
- Output data was not triggered at all
- Pre-exec triggering on native
- Resolves #301
2019-08-16 17:07:44 +00:00
Fred Park
07e86a3928
Fix Network Direct RDMA VM provisioning
...
- Resolves #299
2019-08-14 21:56:45 +00:00
Fred Park
4ab382761a
Minor post-release fixes
...
- Update 3rd party notices
- Move docker directory to build
- Fix old info in a few recipes
2019-08-14 15:13:28 +00:00
Fred Park
ff49d187a4
Tag for 3.8.0 release
2019-08-14 03:23:09 +00:00
Fred Park
826c46afe2
Bring your own Public IP support
2019-08-14 03:23:09 +00:00
Fred Park
e9130f83f4
MCR migration
...
- Migrate images to Microsoft Container Registry
- Fix Shellcheck issues
- Related to #278
2019-08-14 03:23:03 +00:00
Fred Park
290209381e
Update Dependencies
...
- Update NVIDIA compute driver to 418.67
- Update NVIDIA grid driver to 430.30
- Update Batch Insights to 1.3.0
- Update blobxfer to 1.9.0
- Update Python dependencies
- Drop Python 3.4 support
2019-08-12 20:42:32 +00:00
Fred Park
3052e98c8b
Add MVAPICH support
...
- More changes for #287
- Automatically source environment modules if it exists
- Fix some typos
2019-08-12 01:58:39 +00:00
Fred Park
be52a9c3b0
Various updates
...
- Fail VM provisioning if expected IB card is not present
- Update platform image native support
2019-08-12 01:58:28 +00:00
Fred Park
b6044b3489
Update GPU support
...
- Update to Docker CE 19.03.1
- Use "native" Docker/containerd GPU support
- Breaking change in jobs configuration to allow arbitrary configuration
- Update docs
- Resolves #293
2019-08-08 20:36:41 +00:00
Fred Park
00f1c95b1d
Update Dockerfiles to Alpine 3.10
2019-08-08 20:11:33 +00:00
Fred Park
e6709409a2
Update to Singularity 3.3.0
...
- Check for expected ephemeral mount point
2019-08-07 21:13:30 +00:00
Fred Park
caec6b566f
Allow Premium File Shares via AAD
...
- Documentation clarification around the main storage account
- Resolves #294
2019-08-05 18:29:27 +00:00
Fred Park
7ae3cb9e50
Merge branch 'master' into singularity3
2019-08-05 18:28:19 +00:00
Vincent Labonté
b64c3cb324
Add Infiniband support with Open MPI and MPICH ( #297 )
...
* Add Infinibnad support with Open MPI
* Add mpiBench-Infiniband-OpenMPI recipe
* Add setup script for OpenFOAM-Infiniband-OpenMPI recipe
* Update setup script for OpenFOAM-Infiniband-OpenMPI recipe
* Add OpenFOAM-Infiniband-OpenMPI recipe
* Add documentation for recipes
* Add Infiniband support with MPICH
* Add mpiBench-Infiniband-MPICH recipe
2019-08-05 10:39:08 -04:00
Fred Park
d3fccd613d
Tag for 3.7.1 release
2019-07-24 02:55:52 +00:00
Fred Park
3c376224a3
Fix GPU node provisioning
...
- Start task failures due to docker-ce-cli info changing output
- Pin docker-ce-cli
- Make docker root dir parsing more robust
- Fix LIS and CentOS 7.6 GPU provisioning
- Resolves #291
2019-07-24 02:55:35 +00:00
Fred Park
4d69c96d79
Merge branch 'sriov-merge' into singularity3
2019-07-23 21:02:52 +00:00
Vincent Labonté
cc42916cba
Fixes and update of recipes ( #290 )
...
* Fix multi-instance tasks that are not a MPI task
* Add setup task script for CNTK-CPU-Infiniband-IntelMPI
* Update CNTK-CPU-Infiniband-IntelMPI recipe
* Add MPI executable path option
* Update CNTK-CPU-OpenMPI recipe
* Change the default MPI executable_path to mpirun
* Modify CNTK-CPU-Infiniband-IntelMPI recipe
* Add setup task script for CNTK-GPU-Infiniband-IntelMPI
* Update CNTK-GPU-Infiniband-IntelMPI recipe
* Add setup task script for CNTK-GPU-OpenMPI
* Add setup task script for NAMD-Infiniband-IntelMPI
* Update NAMD-Infiniband-IntelMPI recipe
* Add setup task script for OpenFOAM-Infiniband-IntelMPI
* Update OpenFOAM-Infiniband-IntelMPI recipe
* Update TensorFlow-GPU Singularity recipe
* Add setup task script for OpenFOAM-TCP-OpenMPI
* Update OpenFOAM-TCP-OpenMPI recipe
* Add support for arbitrary commands with the MPI processes_per_node option
* Fix MPI with native images
* Modify CNTK-CPU-Infiniband-IntelMPI recipe
* Modify CNTK-GPU-Infiniband-IntelMPI recipe
* Modify NAMD-Infiniband-IntelMPI recipe
* Update processes_per_node documentation
* Fix `pool images list` with Singularity images
* Modify OpenFOAM-Infiniband-IntelMPI set up script
* Add check for mpi setting with Windows
* Add auto scratch support with OpenFOAM-Infiniband-IntelMPI recipe
* Modify OpenFOAM-TCP-OpenMPI set up script
* Add auto scratch support with OpenFOAM-TCP-OpenMPI recipe
* Add mpiBench-IntelMPI recipe
* Add mpiBench-MPICH recipe
* Add mpiBench-OpenMPI recipe
* Resolve PR comments
* Resolve PR comments
2019-07-17 18:57:06 -07:00
Fred Park
ce0caaa24d
Add promo VM size (NC/NV/H) support
2019-07-16 16:07:03 +00:00
Fred Park
e361008550
Update Alpine and Python
...
- Alpine to 3.10
- Python to 3.7.4
2019-07-15 03:32:24 +00:00
Fred Park
ec3c6f49be
Update Intel Benchmarks using Hc
...
- Docker and Singularity3 updates
2019-07-15 03:32:24 +00:00
Fred Park
25fec92273
Support Hc/Hb
...
- Support RDMA bifurcation
- Update platform docs for CentOS-HPC 7.6
2019-07-15 03:32:04 +00:00
Fred Park
9736cea593
Update NAMD-TCP recipe
...
- Fix connection issues
2019-07-09 21:46:34 +00:00
Fred Park
559463cd12
Merge branch 'develop' into sriov-merge
2019-07-09 21:45:31 +00:00
Vincent Labonté
442a22bd28
Improve MPI Interface for Singularity and Docker ( #289 )
...
* Add MPI config support for MPICH
* Add MPI config support for Docker containers
* Resolve PR comments
* Make use of the script runner with MPI and Docker
* Minor fixes
* Resolve PR comments
2019-07-09 13:46:12 -07:00
Vincent Labonté
e6e60048a7
Improve MPI Interface for Intel MPI and Open MPI with Singularity images ( #288 )
...
* Add MPI config support for IntelMPI
* Separate prologue command into user and system
* Add MpiSettings
* Add MPI config support for Open MPI
* Fix MPI config support for IntelMPI
* Workaround for Open MPI btl tcp
* Correct documentation
* Fix non mpi multi instance execution
* Resolve PR comments
* Resolve PR comments
* Partially address #287
2019-07-03 12:40:54 -07:00
Fred Park
878cc18487
Update docs for Shared Image Gallery support
2019-06-27 20:49:37 +00:00
Fred Park
4b9a004f1a
Update to Batch 7.0.0 SDK
...
- Breaking change: pool listskus -> account images
- Support setting working directory for native mode
- Resolves #286
2019-06-27 20:08:49 +00:00
Fred Park
b93f60213d
Support conditional output data
...
- Resolves #230
2019-06-24 18:03:43 +00:00
Fred Park
7b138e785a
Support user-specified job prep/release tasks
...
- Host mode only
- Resolves #202
2019-06-24 16:02:30 +00:00
Fred Park
eb3c70bbf5
Fix autoscratch setup issue
2019-06-21 19:55:27 +00:00
Vincent Labonté
9f58ad0042
Fixes for Singularity 3 support ( #285 )
...
* Fix credentials when running task with Singularity docker:// images
* Fix Singularity cache directory's ownership
* Fix images update command
* Fix running cascade with use_shipyard_docker_image
* Remove envfile dump in task runner
2019-06-21 10:36:28 -07:00