Граф коммитов

82 Коммитов

Автор SHA1 Сообщение Дата
Fred Park 0fc3522c50
Tag for 3.8.1 release 2019-08-19 16:24:33 +00:00
Fred Park 826c46afe2
Bring your own Public IP support 2019-08-14 03:23:09 +00:00
Fred Park 290209381e
Update Dependencies
- Update NVIDIA compute driver to 418.67
- Update NVIDIA grid driver to 430.30
- Update Batch Insights to 1.3.0
- Update blobxfer to 1.9.0
- Update Python dependencies
- Drop Python 3.4 support
2019-08-12 20:42:32 +00:00
Fred Park 4b9a004f1a
Update to Batch 7.0.0 SDK
- Breaking change: pool listskus -> account images
- Support setting working directory for native mode
- Resolves #286
2019-06-27 20:08:49 +00:00
Fred Park ec7af5b7c1
Various updates
- Update docs
- Update azure-batch dependency
- Set Slurm scheduling option defer mode
2019-03-22 14:50:26 -07:00
Fred Park 97dac7d5aa
Update blobxfer to 1.7.1
- Update some docs
2019-03-05 08:05:02 -08:00
Fred Park 6e8d2a119f
Component updates
- Update blobxfer to 1.7.0
- Update Batch Insights to 1.2.0
- Update LIS
- Update NV driver to 410.92
- Update NC/ND driver to 410.104
2019-02-28 12:11:19 -08:00
Fred Park a30cb674ca
Migrate to Azure Batch Python SDK 6.0.0
- Fix breaking changes
- Update dependencies
- Gate some debug messages behind the verbose flag
2019-01-16 13:03:30 -08:00
Fred Park 1253ee0062
Component updates
- Update blobxfer to 1.6.0
- Update Singularity to 2.6.1
- Update Docker CE to 18.09.1
- Move monitor setup to after GPU driver installation
2019-01-16 13:03:30 -08:00
Fred Park eea4286724
Update dependencies
- NC/ND driver to 410.79
- NV Grid driver to 410.71 with CUDA10 support
- LIS
2018-12-03 09:03:49 -08:00
Fred Park 2519f3cedd
Update dependencies 2018-11-19 11:20:08 -08:00
Fred Park 62e8ebcac1
Update dependencies
- Update blobxfer to 1.5.4
- Resolves #243
2018-11-05 11:24:08 -08:00
Fred Park 96c220df34
Update to Azure Batch 5.1.0 SDK
- Accommodate breaking changes
- Add compute node agent info
2018-09-18 13:56:26 -07:00
Fred Park 1d666ae6aa
Update dependencies
- Update blobxfer to 1.5.0
2018-09-18 13:56:26 -07:00
Fred Park 4abfaf1675
Update blobxfer to 1.4.0 2018-08-08 15:48:58 -07:00
Fred Park acdea94722
Tag for 3.6.0a1 release 2018-08-06 10:35:31 -07:00
Fred Park 52628d27cf
Federation support
- Federation proxy lifecycle management
- Federation lifecycle management
- Federation job submission and management
- Mount Azure File share for auto-rotated log persistence
- FIFO within job support
- Constraint matching
- Federations can be created in "unique job id" mode requiring all
  submitted jobs via fed jobs add be unique across the entire federation
- Supports nearly 15K actions per job (in non-unique job id mode)
- Task dependency rewrite engine for federated jobs
  - Verify dependencies only within task group
  - Uniquely identify task dependencies
- Allow tuning of scheduling behavior options
- Package federation logic on proxy into Docker container
- Full guide/walkthrough for federation feature
- Refactor common code between monitor/fed proxy into resource
- Other doc updates
2018-08-06 09:30:36 -07:00
Fred Park 7060366213
Update dependencies 2018-07-17 11:18:20 -07:00
Fred Park 0c3d492f1a
Various fixes and updates
- Dump node listing on unusable
- Update Gluster to 4.1 on Ubuntu/Debian/RemoteFS
- Update Python to 3.6.6 in Windows Docker images
- Update dependencies
- Minor doc updates
- Fix appveyor sha256 artifact upload
2018-06-29 09:32:10 -07:00
Fred Park 66e77ac397
Update dependencies
- Fixes in scripts for cascade and monitor cert renewal
2018-06-27 15:32:22 -07:00
Fred Park 2336a63990
Add GitHub issue templates
- Update AppVeyor build
- Update requirements
- Minor doc updates
2018-06-18 10:12:23 -07:00
Fred Park a00ecab340
Tag for 3.5.0b2 release
- Update Nuget package metadata
2018-06-12 14:00:25 -07:00
Fred Park 9f61db12c3
Autoprovision Grafana Dashboard
- Add default dashboard
- Allow arbitrary provisioning of additional dashboards
- Add monitor list command
- Add RemoteFS monitoring support
- Compact cadvisor
2018-06-07 10:50:37 -07:00
Fred Park b77a147766
Continue Prometheus integration support
- Add nginx reverse proxy and letsencrypt cert support
- Add let's encrypt options
- Add picket into compose
- crontab cert renewal
- Add inbound rule management for temporary ACME challenge on port 80
- Update to node exporter 0.16
- Fixup various issues
2018-06-04 09:04:30 -07:00
Fred Park d4c6aa99ae
Support CentOS 7.4 GPU
- Add LIS installation support
- Update NC driver to 396.26
- Update dependencies
- Resolves #199
2018-06-04 09:03:03 -07:00
Fred Park ae86b92be2
Start Prometheus monitoring integration
- Refactor package uploader for pool
- Auto install node exporter and cadvisor for prom enabled pools
- Add configuration
- Create monitoring resource
- Start work on picket monitor
2018-06-04 08:56:44 -07:00
Fred Park 414d1ed9fd
Update dependencies
- Update 3rd party notices
- Unpin pip for travis
2018-05-01 13:45:11 -07:00
Fred Park 1756e57e92
Call concurrent actions asynchronously
- Resolves #188
- Add -no-generate-tunnel-script option to pool nodes grls
2018-04-20 10:32:47 -07:00
Fred Park 54aea32a20
Update blobxfer to 1.2.0 2018-04-19 08:44:04 -07:00
Fred Park 350f1185d9
Allow AAD on storage credentials
- Resolves #179
2018-04-18 08:09:13 -07:00
Fred Park c700b257e1
Integrate Shellcheck into travis build
- Resolves #178
2018-04-04 13:34:24 -07:00
Fred Park 74d20675bf
Tag for 3.4.0 release 2018-03-26 13:19:51 -07:00
Fred Park 330c193422
Improve prep scripts
- Add timestamps for logging
- Add more Docker and nvidia details
- Save prior startup logs
- Update dependencies
2018-03-22 10:45:28 -07:00
Fred Park 1a4c04bd62
Update dependencies 2018-03-19 10:02:14 -07:00
Fred Park a195d7e242
Update dependencies 2018-02-28 15:08:33 -08:00
Fred Park e2d63541b4
Fix appveyor script to use correct pip version 2018-02-21 08:18:23 -08:00
Fred Park b4e6e4320d
Various updates
- Fix image update to work in multi-instance mode with registry logins
- Allow CentOS 7.3 provisioning to continue to work
- Allow CentOS-HPC 7.1 provisioning
- Add CentOS 7.4 support
- Add Debian 9 support
- Update dependencies
2018-02-16 09:18:34 -08:00
Fred Park 663548d91e
Minor updates
- Update dependencies
- Check for out of date deps
2018-02-07 14:32:32 -08:00
Fred Park f68e19edb1
Update build
- Update to blobxfer 1.1.1
2018-01-30 12:52:16 -08:00
Fred Park 5d08581e51
Update dependencies and third party notices 2018-01-25 14:32:44 -08:00
Fred Park c24ab46ba0
Add configuration validation
- Resolves #145
2018-01-22 10:54:26 -08:00
Fred Park 1a5011144d Update remotefs to use latest dependency
- Redirect using old API
- Detect VM allocation failures in RemoteFS
2017-11-10 12:47:26 -08:00
Fred Park 90283298e6 Update dependencies 2017-11-10 09:23:15 -08:00
Fred Park 38b10b80b7 Update to blobxfer 1.0.0 2017-11-06 12:55:17 -08:00
Fred Park 4dc228aeaf Fix job submission on custom image pools 2017-11-06 08:19:48 -08:00
Fred Park 6f62740292 Tag for 3.0.0a2 release 2017-10-27 11:35:48 -07:00
Fred Park bb03797360 Fix no nodes listed on resizing state
- Update dependencies
2017-10-24 13:08:26 -07:00
Fred Park 607bfd252e Migrate to storage split library
- Remove queue deletion code
- Resolves #133
2017-10-05 21:40:50 -07:00
Fred Park 796a5e33b4 Combine rjm/tfm to cargo (#125) 2017-10-03 18:24:50 -07:00
Fred Park 6315be3a6b Transition to blobxfer 1.x command structure
- Data ingress/egress changes
- Task factory file changes
- Resolves #47
2017-10-03 18:24:49 -07:00