Граф коммитов

885 Коммитов

Автор SHA1 Сообщение Дата
Fred Park 2d27034cc7
Add FI_PROVIDER to Intel MPI ofi fabrics 2019-10-25 17:55:33 +00:00
Fred Park b5b380b41f
Improve error message on account_service_url 2019-10-23 15:45:24 +00:00
Fred Park da407832d9
Fix AppVeyor issue 2019-10-23 15:10:02 +00:00
Fred Park 8d1fc8e46e
Warn/error when mixing auto_scratch with autoscale
- Resolves #319
2019-10-22 16:58:38 +00:00
Fred Park 9fd1f9b08c
Get CentOS 7.6 kernel sources from vault
- Unify GPU query path and add diagnostics query
2019-10-18 16:27:04 +00:00
Fred Park acb7c6f40c
Fix streaming race between in task termination 2019-10-17 16:32:07 +00:00
Fred Park 59a155576b
Add SR-IOV setting for ubuntu 2019-10-16 15:49:41 +00:00
Fred Park 4fb1ef7238
Unify Docker root dir check 2019-10-16 02:01:28 +00:00
Fred Park 0cf8b48f3e
Fix pre-exec in non-native
- Command should not be wrapped in a shell
- Fix user add error message if ssh key is not present
2019-10-10 23:42:49 +00:00
Fred Park b460f4510c
Fix RemoteFS provisioning issues with 1 disk 2019-10-10 18:03:59 +00:00
Fred Park 3521739088
Fix remotefs bootstrap
- Samba options were being invoked for non-samba enabled clusters
2019-10-09 15:48:40 +00:00
Fred Park 81bad510dd
Update Function App guide
- Resolves #315
2019-10-01 15:55:41 +00:00
Fred Park e252f4be0d
Update jobs doc regarding envlist
- Environment variables are already sourced. Further expansion of
environment variables should be performed by shell invocation or typical
conventions by the invoking program (#314)
2019-09-17 16:03:39 +00:00
Fred Park 0244614034
Fix native output_data to observe full remote path
- Resolves #313
2019-09-13 19:56:57 +00:00
Fred Park bc9b872bba
Tag for 3.8.2 release 2019-09-12 16:38:07 +00:00
Fred Park a02e35d329
Fix task output_data on native pools with includes
- Resolves #313
2019-09-12 16:15:53 +00:00
Fred Park 43fda94278
Update drivers and dependencies
- Docker CE 19.03.2
- blobxfer to 1.9.2
- NC/ND driver to 418.87.00
2019-09-11 17:51:13 +00:00
Fred Park 89dff6b201
Prevent job submission on older pools
- Resolves #312
2019-09-11 17:44:56 +00:00
Fred Park 74d339c413
Add doc note regarding Windows env vars
- Update DotNet recipe to use container default working dir option
- Partially addresses #311
2019-09-09 15:26:21 +00:00
Fred Park 1706902959
Fix non-native data transfer sequence coupling
- Non-native input_data or output_data of azure_storage type with
sequences greater than 1 would have each individual action depend upon
the success of the prior action
- Resolves #310
2019-09-05 19:29:33 +00:00
Fred Park cbf137422e
Fix task termination for infinite retry tasks
- Resolves #308
2019-09-03 15:12:10 +00:00
Fred Park 03046aa692
Fix possible null from node error value collection
- Resolves #309
2019-08-30 21:13:22 +00:00
Fred Park 0d5850c8c9
Fix task termination in non-native mode
- SSH side-channel docker kill signal was not being sent as Docker tasks
were not being detected properly
- Also fix issue with pool images update not executing if block on
images is false
- Resolves #308
2019-08-30 20:59:51 +00:00
Fred Park 29c368ffd9
Fix non-string env var in recurring jobs
- Instead of attempting to coalesce all environment variables as strings
(which may not round trip properly), add ruamel.yaml as a dependency in
the recurrent job manager
- Resolves #306
2019-08-30 17:14:13 +00:00
Fred Park 0e773c5158
Update docs regarding AAD and subscription id
- Provide better error message in this case of missing subscription id
- Resolves #305
2019-08-30 15:53:19 +00:00
Fred Park 3a91511e50
Fix possible null node agent info on list nodes
- Resolves #307
2019-08-30 15:30:49 +00:00
Fred Park 88c3cdf8be
Fix prefix filter on task factory remote_path
- Resolves #303
2019-08-29 16:53:47 +00:00
Fred Park d77d8a8cce
Fix download cascade outputs on start task failure 2019-08-23 15:42:35 +00:00
Fred Park 0fc3522c50
Tag for 3.8.1 release 2019-08-19 16:24:33 +00:00
Fred Park 8b7b17f465
Fix Task Runner regressions
- Input/output data phases not correctly triggered for multi-instance
and MPI jobs
- Output data was not triggered at all
- Pre-exec triggering on native
- Resolves #301
2019-08-16 17:07:44 +00:00
Fred Park 07e86a3928
Fix Network Direct RDMA VM provisioning
- Resolves #299
2019-08-14 21:56:45 +00:00
Fred Park 4ab382761a
Minor post-release fixes
- Update 3rd party notices
- Move docker directory to build
- Fix old info in a few recipes
2019-08-14 15:13:28 +00:00
Fred Park ff49d187a4
Tag for 3.8.0 release 2019-08-14 03:23:09 +00:00
Fred Park 826c46afe2
Bring your own Public IP support 2019-08-14 03:23:09 +00:00
Fred Park e9130f83f4
MCR migration
- Migrate images to Microsoft Container Registry
- Fix Shellcheck issues
- Related to #278
2019-08-14 03:23:03 +00:00
Fred Park 290209381e
Update Dependencies
- Update NVIDIA compute driver to 418.67
- Update NVIDIA grid driver to 430.30
- Update Batch Insights to 1.3.0
- Update blobxfer to 1.9.0
- Update Python dependencies
- Drop Python 3.4 support
2019-08-12 20:42:32 +00:00
Fred Park 3052e98c8b
Add MVAPICH support
- More changes for #287
- Automatically source environment modules if it exists
- Fix some typos
2019-08-12 01:58:39 +00:00
Fred Park be52a9c3b0
Various updates
- Fail VM provisioning if expected IB card is not present
- Update platform image native support
2019-08-12 01:58:28 +00:00
Fred Park b6044b3489
Update GPU support
- Update to Docker CE 19.03.1
- Use "native" Docker/containerd GPU support
- Breaking change in jobs configuration to allow arbitrary configuration
- Update docs
- Resolves #293
2019-08-08 20:36:41 +00:00
Fred Park 00f1c95b1d
Update Dockerfiles to Alpine 3.10 2019-08-08 20:11:33 +00:00
Fred Park e6709409a2
Update to Singularity 3.3.0
- Check for expected ephemeral mount point
2019-08-07 21:13:30 +00:00
Fred Park caec6b566f
Allow Premium File Shares via AAD
- Documentation clarification around the main storage account
- Resolves #294
2019-08-05 18:29:27 +00:00
Fred Park 7ae3cb9e50
Merge branch 'master' into singularity3 2019-08-05 18:28:19 +00:00
Vincent Labonté b64c3cb324 Add Infiniband support with Open MPI and MPICH (#297)
* Add Infinibnad support with Open MPI

* Add mpiBench-Infiniband-OpenMPI recipe

* Add setup script for OpenFOAM-Infiniband-OpenMPI recipe

* Update setup script for OpenFOAM-Infiniband-OpenMPI recipe

* Add OpenFOAM-Infiniband-OpenMPI recipe

* Add documentation for recipes

* Add Infiniband support with MPICH

* Add mpiBench-Infiniband-MPICH recipe
2019-08-05 10:39:08 -04:00
Fred Park d3fccd613d
Tag for 3.7.1 release 2019-07-24 02:55:52 +00:00
Fred Park 3c376224a3
Fix GPU node provisioning
- Start task failures due to docker-ce-cli info changing output
- Pin docker-ce-cli
- Make docker root dir parsing more robust
- Fix LIS and CentOS 7.6 GPU provisioning
- Resolves #291
2019-07-24 02:55:35 +00:00
Fred Park 4d69c96d79
Merge branch 'sriov-merge' into singularity3 2019-07-23 21:02:52 +00:00
Vincent Labonté cc42916cba Fixes and update of recipes (#290)
* Fix multi-instance tasks that are not a MPI task

* Add setup task script for CNTK-CPU-Infiniband-IntelMPI

* Update CNTK-CPU-Infiniband-IntelMPI recipe

* Add MPI executable path option

* Update CNTK-CPU-OpenMPI recipe

* Change the default MPI executable_path to mpirun

* Modify CNTK-CPU-Infiniband-IntelMPI recipe

* Add setup task script for CNTK-GPU-Infiniband-IntelMPI

* Update CNTK-GPU-Infiniband-IntelMPI recipe

* Add setup task script for CNTK-GPU-OpenMPI

* Add setup task script for NAMD-Infiniband-IntelMPI

* Update NAMD-Infiniband-IntelMPI recipe

* Add setup task script for OpenFOAM-Infiniband-IntelMPI

* Update OpenFOAM-Infiniband-IntelMPI recipe

* Update TensorFlow-GPU Singularity recipe

* Add setup task script for OpenFOAM-TCP-OpenMPI

* Update OpenFOAM-TCP-OpenMPI recipe

* Add support for arbitrary commands with the MPI processes_per_node option

* Fix MPI with native images

* Modify CNTK-CPU-Infiniband-IntelMPI recipe

* Modify CNTK-GPU-Infiniband-IntelMPI recipe

* Modify NAMD-Infiniband-IntelMPI recipe

* Update processes_per_node documentation

* Fix `pool images list` with Singularity images

* Modify OpenFOAM-Infiniband-IntelMPI set up script

* Add check for mpi setting with Windows

* Add auto scratch support with OpenFOAM-Infiniband-IntelMPI recipe

* Modify OpenFOAM-TCP-OpenMPI set up script

* Add auto scratch support with OpenFOAM-TCP-OpenMPI recipe

* Add mpiBench-IntelMPI recipe

* Add mpiBench-MPICH recipe

* Add mpiBench-OpenMPI recipe

* Resolve PR comments

* Resolve PR comments
2019-07-17 18:57:06 -07:00
Fred Park ce0caaa24d
Add promo VM size (NC/NV/H) support 2019-07-16 16:07:03 +00:00
Fred Park e361008550
Update Alpine and Python
- Alpine to 3.10
- Python to 3.7.4
2019-07-15 03:32:24 +00:00