Граф коммитов

923 Коммитов

Автор SHA1 Сообщение Дата
Fred Park 151d546bed
Add PyTorch recipes 2018-11-29 08:57:56 -08:00
Fred Park 5aae4832b8
Add Windows Server 2019 support 2018-11-19 12:48:33 -08:00
Fred Park 2519f3cedd
Update dependencies 2018-11-19 11:20:08 -08:00
Fred Park 890e79cd24
Update Docker CE to 18.09.0 2018-11-19 09:55:23 -08:00
Fred Park 5c5feaf244
Various fixes
- fs cluster status typo
- Add delay in MSI binding for resources
- Singularity envfile naming
2018-11-19 09:54:57 -08:00
Fred Park 2ad67da15d
Tag for 3.6.0 release 2018-11-06 14:21:50 -08:00
Fred Park 342b7fc2e2
Fix various issues
- Monitoring SSH login
- Grafana update regression with Batch Shipyard Dashboard
- Federation job submission
2018-11-06 14:21:03 -08:00
Fred Park 95dec309fe
Add support for standard and ultra SSDs
- Breaking change on premium property in managed disks
- Add availability zone support
2018-11-06 10:00:25 -08:00
Fred Park 7ad6a1df05
Remove Debian 8 support 2018-11-05 11:29:02 -08:00
Fred Park 54c78b7c49
Auto scratch support 2018-11-05 11:24:22 -08:00
Fred Park 02c6e110d7
Kata containers support
- Make Singularity runtime install optional
- Add `restrict_default_bind_mounts` option to jobs spec
- Provide a default container runtime option
2018-11-05 11:24:17 -08:00
Fred Park a53bb2a044
Fix Singularity issues in latest update 2018-11-05 11:24:17 -08:00
Fred Park 49b7e48857
Fix non-public cloud SP AAD auth 2018-11-05 11:24:16 -08:00
Fred Park 62e8ebcac1
Update dependencies
- Update blobxfer to 1.5.4
- Resolves #243
2018-11-05 11:24:08 -08:00
Fred Park ab9cc70828
Update build to Python 3.7.1
- Update Windows Docker images to Python 3.7.1
- Fix flake8 errors
- Fix shellcheck errors
- Various build updates and fixes
2018-10-30 14:24:31 -07:00
Fred Park 2624ba3d87
Migrate Azure DevOps pipelines to pool syntax 2018-10-11 14:35:52 -07:00
Fred Park db009a3dc9
Various minor updates 2018-10-04 13:51:27 -07:00
Eduardo Arango 70e150858f Update Singularity git repo (#241)
Singularity is now hosted under sylabs/singularity, update from
singularityware/singularity

Signed-off-by: ArangoGutierrez <eduardo@sylabs.io>
2018-10-04 10:02:45 -07:00
Fred Park 6f35a50bf7
Rename MADL recipe to HPMLA 2018-10-04 09:59:19 -07:00
Fred Park 124bb429a0
Update download location for NV driver 2018-10-01 11:58:18 -07:00
Fred Park 784393a6ce
Tag for 3.6.0b1 release 2018-09-20 12:49:12 -07:00
Fred Park 290e460b50
Migrate to VSTS for builds and deployments
- Sign Windows builds (both exe and nupkg)
- Add MacOS build
- Remove AppVeyor/Travis builds and deployments but keep testing matrix
2018-09-20 12:30:33 -07:00
Fred Park 0d24857c18
Update Docker to 18.06.1 2018-09-20 10:37:10 -07:00
Fred Park 15ea3b16af
Support for Ubuntu 18.04 2018-09-20 09:24:29 -07:00
Fred Park 32561ae264
Support CentOS 7.5 native and native conversion 2018-09-20 09:24:29 -07:00
Fred Park 584dada9f8
Update Singularity and Alpine
- Update to 3.8, rebuild 3.7 due to CVE
- Update Singularity to 2.6.0
2018-09-20 08:48:15 -07:00
Fred Park 06ab86c655
Update various components
- Update Nvidia Tesla driver to 396.44 for NC
- Update LIS to 4.2.6
- Update prometheus and grafana
2018-09-18 13:56:27 -07:00
Fred Park b06bb20d4f
Fix autoscale scaling beyond low pri limit
- Refactor formulas
- Resolves #239
2018-09-18 13:56:27 -07:00
Fred Park 96c220df34
Update to Azure Batch 5.1.0 SDK
- Accommodate breaking changes
- Add compute node agent info
2018-09-18 13:56:26 -07:00
Fred Park f73f1e25a7
Fix shellcheck errors 2018-09-18 13:56:26 -07:00
Fred Park 1d666ae6aa
Update dependencies
- Update blobxfer to 1.5.0
2018-09-18 13:56:26 -07:00
Fred Park 1a4ad686ef
Fix federation task id generator
- Fix list issue with empty addition timestamps or uids
- Expedite generating task ids for federation bound tasks with autogenerated
  task ids
2018-08-23 13:37:03 -07:00
Fred Park c1bbd5131d
Add count commands
- jobs tasks count and pool nodes count commands with --raw support
- Update usage doc
- Resolves #228
2018-08-09 13:07:06 -07:00
Fred Park bc7d87c397
Enhance blocked action tracking
- Track blocked actions in jobs table
- Enhance fed jobs list to list both blocked and queued actions
- Update docs
2018-08-09 09:27:38 -07:00
Fred Park 4abfaf1675
Update blobxfer to 1.4.0 2018-08-08 15:48:58 -07:00
Fred Park 6e1409c16f
Fix jobs tasks term command without pool ssh info 2018-08-08 15:48:36 -07:00
Fred Park f8a700616d
Doc updates 2018-08-06 14:20:11 -07:00
Fred Park acdea94722
Tag for 3.6.0a1 release 2018-08-06 10:35:31 -07:00
Fred Park 52628d27cf
Federation support
- Federation proxy lifecycle management
- Federation lifecycle management
- Federation job submission and management
- Mount Azure File share for auto-rotated log persistence
- FIFO within job support
- Constraint matching
- Federations can be created in "unique job id" mode requiring all
  submitted jobs via fed jobs add be unique across the entire federation
- Supports nearly 15K actions per job (in non-unique job id mode)
- Task dependency rewrite engine for federated jobs
  - Verify dependencies only within task group
  - Uniquely identify task dependencies
- Allow tuning of scheduling behavior options
- Package federation logic on proxy into Docker container
- Full guide/walkthrough for federation feature
- Refactor common code between monitor/fed proxy into resource
- Other doc updates
2018-08-06 09:30:36 -07:00
Fred Park 977c2e920b
Tag for 3.5.3 release 2018-07-31 11:04:25 -07:00
Fred Park 7a92ed39b4
Improve Docker login robustness
- Omit registry info for native container pools when
  delay_docker_image_preload is enabled
- Allow cascade image management to scale to large node counts
2018-07-31 10:18:49 -07:00
Fred Park e069e72564
Support Docker image preload delay
- This option is only available for Linux non-native pools. All other
  pool types ignore this option.
2018-07-28 18:44:57 -07:00
Fred Park 3c76f9e1d9
Fix various environment variable issues
- Remove environment variable file create/upload and use task environment
  variable and dump instead for non-native pools
- Remove unnecessary windows env var dumps for non-native mode which is
  an impossible combination
- Clean-up job schedule job manager task env vars
- Simplify multi-instance env var handling
- Resolves #234
2018-07-28 16:15:28 -07:00
Fred Park f2aa2a4eba
Fix Docker Hub private registry login 2018-07-27 08:13:28 -07:00
Fred Park bfcfcc1f37
Tag for 3.5.2 release 2018-07-20 09:17:38 -07:00
Fred Park 85db792918
Fix nvidia-docker2 installation
- Properly pin dependencies
- Resolves #231
- Update bug report template
2018-07-20 09:14:10 -07:00
Fred Park aa3b7f1cef
Tag for 3.5.1 release 2018-07-17 13:30:42 -07:00
Fred Park 7060366213
Update dependencies 2018-07-17 11:18:20 -07:00
Fred Park ea1341c1bd
Update drivers and monitoring components
- NC/ND update to 396.37
- NV update to 390.75
- LIS update to 4.2.5-2
- Prometheus to 2.3.2
- Grafana to 5.2.1
2018-07-17 11:02:39 -07:00
Fred Park 15f4e94be0
Update Gluster to 4.1 on CentOS 2018-07-09 10:23:48 -07:00