Fred Park
826c46afe2
Bring your own Public IP support
2019-08-14 03:23:09 +00:00
Fred Park
290209381e
Update Dependencies
...
- Update NVIDIA compute driver to 418.67
- Update NVIDIA grid driver to 430.30
- Update Batch Insights to 1.3.0
- Update blobxfer to 1.9.0
- Update Python dependencies
- Drop Python 3.4 support
2019-08-12 20:42:32 +00:00
Fred Park
4b9a004f1a
Update to Batch 7.0.0 SDK
...
- Breaking change: pool listskus -> account images
- Support setting working directory for native mode
- Resolves #286
2019-06-27 20:08:49 +00:00
Fred Park
ec7af5b7c1
Various updates
...
- Update docs
- Update azure-batch dependency
- Set Slurm scheduling option defer mode
2019-03-22 14:50:26 -07:00
Fred Park
97dac7d5aa
Update blobxfer to 1.7.1
...
- Update some docs
2019-03-05 08:05:02 -08:00
Fred Park
6e8d2a119f
Component updates
...
- Update blobxfer to 1.7.0
- Update Batch Insights to 1.2.0
- Update LIS
- Update NV driver to 410.92
- Update NC/ND driver to 410.104
2019-02-28 12:11:19 -08:00
Fred Park
a30cb674ca
Migrate to Azure Batch Python SDK 6.0.0
...
- Fix breaking changes
- Update dependencies
- Gate some debug messages behind the verbose flag
2019-01-16 13:03:30 -08:00
Fred Park
1253ee0062
Component updates
...
- Update blobxfer to 1.6.0
- Update Singularity to 2.6.1
- Update Docker CE to 18.09.1
- Move monitor setup to after GPU driver installation
2019-01-16 13:03:30 -08:00
Fred Park
eea4286724
Update dependencies
...
- NC/ND driver to 410.79
- NV Grid driver to 410.71 with CUDA10 support
- LIS
2018-12-03 09:03:49 -08:00
Fred Park
2519f3cedd
Update dependencies
2018-11-19 11:20:08 -08:00
Fred Park
62e8ebcac1
Update dependencies
...
- Update blobxfer to 1.5.4
- Resolves #243
2018-11-05 11:24:08 -08:00
Fred Park
96c220df34
Update to Azure Batch 5.1.0 SDK
...
- Accommodate breaking changes
- Add compute node agent info
2018-09-18 13:56:26 -07:00
Fred Park
1d666ae6aa
Update dependencies
...
- Update blobxfer to 1.5.0
2018-09-18 13:56:26 -07:00
Fred Park
4abfaf1675
Update blobxfer to 1.4.0
2018-08-08 15:48:58 -07:00
Fred Park
acdea94722
Tag for 3.6.0a1 release
2018-08-06 10:35:31 -07:00
Fred Park
52628d27cf
Federation support
...
- Federation proxy lifecycle management
- Federation lifecycle management
- Federation job submission and management
- Mount Azure File share for auto-rotated log persistence
- FIFO within job support
- Constraint matching
- Federations can be created in "unique job id" mode requiring all
submitted jobs via fed jobs add be unique across the entire federation
- Supports nearly 15K actions per job (in non-unique job id mode)
- Task dependency rewrite engine for federated jobs
- Verify dependencies only within task group
- Uniquely identify task dependencies
- Allow tuning of scheduling behavior options
- Package federation logic on proxy into Docker container
- Full guide/walkthrough for federation feature
- Refactor common code between monitor/fed proxy into resource
- Other doc updates
2018-08-06 09:30:36 -07:00
Fred Park
7060366213
Update dependencies
2018-07-17 11:18:20 -07:00
Fred Park
0c3d492f1a
Various fixes and updates
...
- Dump node listing on unusable
- Update Gluster to 4.1 on Ubuntu/Debian/RemoteFS
- Update Python to 3.6.6 in Windows Docker images
- Update dependencies
- Minor doc updates
- Fix appveyor sha256 artifact upload
2018-06-29 09:32:10 -07:00
Fred Park
66e77ac397
Update dependencies
...
- Fixes in scripts for cascade and monitor cert renewal
2018-06-27 15:32:22 -07:00
Fred Park
2336a63990
Add GitHub issue templates
...
- Update AppVeyor build
- Update requirements
- Minor doc updates
2018-06-18 10:12:23 -07:00
Fred Park
a00ecab340
Tag for 3.5.0b2 release
...
- Update Nuget package metadata
2018-06-12 14:00:25 -07:00
Fred Park
9f61db12c3
Autoprovision Grafana Dashboard
...
- Add default dashboard
- Allow arbitrary provisioning of additional dashboards
- Add monitor list command
- Add RemoteFS monitoring support
- Compact cadvisor
2018-06-07 10:50:37 -07:00
Fred Park
b77a147766
Continue Prometheus integration support
...
- Add nginx reverse proxy and letsencrypt cert support
- Add let's encrypt options
- Add picket into compose
- crontab cert renewal
- Add inbound rule management for temporary ACME challenge on port 80
- Update to node exporter 0.16
- Fixup various issues
2018-06-04 09:04:30 -07:00
Fred Park
d4c6aa99ae
Support CentOS 7.4 GPU
...
- Add LIS installation support
- Update NC driver to 396.26
- Update dependencies
- Resolves #199
2018-06-04 09:03:03 -07:00
Fred Park
ae86b92be2
Start Prometheus monitoring integration
...
- Refactor package uploader for pool
- Auto install node exporter and cadvisor for prom enabled pools
- Add configuration
- Create monitoring resource
- Start work on picket monitor
2018-06-04 08:56:44 -07:00
Fred Park
414d1ed9fd
Update dependencies
...
- Update 3rd party notices
- Unpin pip for travis
2018-05-01 13:45:11 -07:00
Fred Park
1756e57e92
Call concurrent actions asynchronously
...
- Resolves #188
- Add -no-generate-tunnel-script option to pool nodes grls
2018-04-20 10:32:47 -07:00
Fred Park
54aea32a20
Update blobxfer to 1.2.0
2018-04-19 08:44:04 -07:00
Fred Park
350f1185d9
Allow AAD on storage credentials
...
- Resolves #179
2018-04-18 08:09:13 -07:00
Fred Park
c700b257e1
Integrate Shellcheck into travis build
...
- Resolves #178
2018-04-04 13:34:24 -07:00
Fred Park
74d20675bf
Tag for 3.4.0 release
2018-03-26 13:19:51 -07:00
Fred Park
330c193422
Improve prep scripts
...
- Add timestamps for logging
- Add more Docker and nvidia details
- Save prior startup logs
- Update dependencies
2018-03-22 10:45:28 -07:00
Fred Park
1a4c04bd62
Update dependencies
2018-03-19 10:02:14 -07:00
Fred Park
a195d7e242
Update dependencies
2018-02-28 15:08:33 -08:00
Fred Park
e2d63541b4
Fix appveyor script to use correct pip version
2018-02-21 08:18:23 -08:00
Fred Park
b4e6e4320d
Various updates
...
- Fix image update to work in multi-instance mode with registry logins
- Allow CentOS 7.3 provisioning to continue to work
- Allow CentOS-HPC 7.1 provisioning
- Add CentOS 7.4 support
- Add Debian 9 support
- Update dependencies
2018-02-16 09:18:34 -08:00
Fred Park
663548d91e
Minor updates
...
- Update dependencies
- Check for out of date deps
2018-02-07 14:32:32 -08:00
Fred Park
f68e19edb1
Update build
...
- Update to blobxfer 1.1.1
2018-01-30 12:52:16 -08:00
Fred Park
5d08581e51
Update dependencies and third party notices
2018-01-25 14:32:44 -08:00
Fred Park
c24ab46ba0
Add configuration validation
...
- Resolves #145
2018-01-22 10:54:26 -08:00
Fred Park
1a5011144d
Update remotefs to use latest dependency
...
- Redirect using old API
- Detect VM allocation failures in RemoteFS
2017-11-10 12:47:26 -08:00
Fred Park
90283298e6
Update dependencies
2017-11-10 09:23:15 -08:00
Fred Park
38b10b80b7
Update to blobxfer 1.0.0
2017-11-06 12:55:17 -08:00
Fred Park
4dc228aeaf
Fix job submission on custom image pools
2017-11-06 08:19:48 -08:00
Fred Park
6f62740292
Tag for 3.0.0a2 release
2017-10-27 11:35:48 -07:00
Fred Park
bb03797360
Fix no nodes listed on resizing state
...
- Update dependencies
2017-10-24 13:08:26 -07:00
Fred Park
607bfd252e
Migrate to storage split library
...
- Remove queue deletion code
- Resolves #133
2017-10-05 21:40:50 -07:00
Fred Park
796a5e33b4
Combine rjm/tfm to cargo ( #125 )
2017-10-03 18:24:50 -07:00
Fred Park
6315be3a6b
Transition to blobxfer 1.x command structure
...
- Data ingress/egress changes
- Task factory file changes
- Resolves #47
2017-10-03 18:24:49 -07:00
Fred Park
238982db77
Add ARM VNet support in Batch service mode ( #126 )
...
- Support "global" aad property in credentials
- Add Virtual Network guide
2017-10-03 10:05:17 -07:00