Граф коммитов

894 Коммитов

Автор SHA1 Сообщение Дата
Fred Park c2cd8f5e67
Update TensorFlow Docker image ref 2018-06-11 13:37:24 -07:00
Fred Park 4573a5c95e
Support native mode platform images
- Auto convert if possible with native enablement
- Update docs
- Resolves #204
2018-06-11 13:32:33 -07:00
Fred Park ca1e9504a7
Cache container/file share creation calls
- Resolves #211
2018-06-11 07:43:15 -07:00
Fred Park 612e2a50e5
Move Prometheus/Grafana config to separate file
- Move grafana admin login info to credentials
- Update documentation for Prometheus/Grafana integration
- Resolves #205
2018-06-08 16:42:52 -07:00
Fred Park 449e621a66
Add suspend/restart support for monitoring 2018-06-08 07:47:15 -07:00
Fred Park cf0797790f
Support max increment of VMs in scenario autoscale
- Allow definition of weekdays/workhours
- Resolves #210
2018-06-08 07:20:41 -07:00
Fred Park e65dd9c196
Add CentOS-HPC 7.4 Support
- Fixup CentOS 7.4 GPU support
- Update LIS to 4.2.5
- Update packer scripts
- Resolves #184
2018-06-07 13:31:01 -07:00
Fred Park 520fe45c9d
Install into virutalenv by default
- Add ludicrous speed quickstart
- Resolves #200
2018-06-07 11:15:30 -07:00
Fred Park 911bcc8593
Fix blobxfer script regression from shellcheck 2018-06-07 10:50:42 -07:00
Fred Park a7a513b804
Don't allow docker only on cAdvisor
- Breaks grafana dashboard
2018-06-07 10:50:42 -07:00
Fred Park 9f61db12c3
Autoprovision Grafana Dashboard
- Add default dashboard
- Allow arbitrary provisioning of additional dashboards
- Add monitor list command
- Add RemoteFS monitoring support
- Compact cadvisor
2018-06-07 10:50:37 -07:00
Fred Park b77a147766
Continue Prometheus integration support
- Add nginx reverse proxy and letsencrypt cert support
- Add let's encrypt options
- Add picket into compose
- crontab cert renewal
- Add inbound rule management for temporary ACME challenge on port 80
- Update to node exporter 0.16
- Fixup various issues
2018-06-04 09:04:30 -07:00
Fred Park ba9dc76f7c
Update gluster to 4.0 2018-06-04 09:03:03 -07:00
Fred Park d4c6aa99ae
Support CentOS 7.4 GPU
- Add LIS installation support
- Update NC driver to 396.26
- Update dependencies
- Resolves #199
2018-06-04 09:03:03 -07:00
Fred Park 95382bf639
Fix appveyor build 2018-06-04 09:03:02 -07:00
Fred Park b7266d2c96
Fix conf load from keyvault
- Improve error messages
- Add missing aad and keyvault decorators to commands
2018-06-04 09:03:02 -07:00
Fred Park ae86b92be2
Start Prometheus monitoring integration
- Refactor package uploader for pool
- Auto install node exporter and cadvisor for prom enabled pools
- Add configuration
- Create monitoring resource
- Start work on picket monitor
2018-06-04 08:56:44 -07:00
Fred Park 32fa9fcaa1
account_key and aad in batch interaction
- These options are now mutually exclusive
- Addresses #197
2018-05-17 09:38:13 -07:00
Fred Park 0bf9399061
Fix some post-release issues
- Shellcheck-induced regressions in containers
- Split Docker image build into own build env on appveyor
- Update to Python 3.6.5 for cargo Docker image build on Windows
2018-05-02 10:24:38 -07:00
Fred Park bbcb479e62
Tag for 3.5.0b1 release 2018-05-02 08:16:04 -07:00
Fred Park 5168320335
Improve site extension installation robustness
- Fix errorlevel checking
- Allow nuget package on pre-releases
2018-05-02 08:15:59 -07:00
Fred Park c84f3b62b3
Update docs 2018-05-02 07:47:26 -07:00
Fred Park d7c3a779b9
Update Dockerfiles to reduce clone depth 2018-05-02 07:46:53 -07:00
Fred Park 414d1ed9fd
Update dependencies
- Update 3rd party notices
- Unpin pip for travis
2018-05-01 13:45:11 -07:00
Fred Park 648941371c
Extend retry policy for all clients
- Minor fix in node prep
2018-05-01 13:18:50 -07:00
Fred Park 7c5ca646dc
Update Singularity to 2.5.0 2018-05-01 10:57:15 -07:00
Fred Park c94508891b
Pin nvidia-docker2 installations
- Use data-root instead of graph in daemon config
- Restart docker service instead of sighup after nvidia-docker2 install
- Update Docker CE to 18.03.1
2018-05-01 09:31:45 -07:00
Fred Park accc48773b
Remove Ubuntu 14.04 support
- Resolves #164
2018-04-27 13:01:40 -07:00
Fred Park e2e8f90b62
Add boot diagnostics support
- Move to Ubuntu 18.04-LTS stable
- Enables serial console access
- Resolves #193
2018-04-27 12:59:58 -07:00
Fred Park 69235ba2c1
Add default_working_dir option
- Clarify where a container runs by default in the jobs config doc
- Resolves #190
2018-04-25 08:23:17 -07:00
Fred Park 8cd1884ac9
Fix documentation for registry creds (#189) 2018-04-25 07:40:02 -07:00
Fred Park 3a90ebda87
Async support for task file mover (#188) 2018-04-24 08:03:35 -07:00
Fred Park 2f800e1df8
Async add task for recurrent job manager (#188) 2018-04-24 07:54:23 -07:00
Fred Park a676a5a8d9
Refactor max workers call (#188)
- Add async to del certs
2018-04-20 14:41:18 -07:00
Fred Park 72484b510b
Add product_iterables support for task factory
- Resolves #187
2018-04-20 11:21:34 -07:00
Fred Park 1756e57e92
Call concurrent actions asynchronously
- Resolves #188
- Add -no-generate-tunnel-script option to pool nodes grls
2018-04-20 10:32:47 -07:00
Fred Park 97d9ca09ce
Relax zip iterable type
- Partially addresses #187
2018-04-19 14:34:57 -07:00
Fred Park 073a37b1a3
Update Nvidia drivers
- CUDA 9.1 support for NCv1
- Update NV driver for CUDA 9.1 support and work with latest Ubuntu
  16.04 releases
- Fix blacklisting of nouveau on CentOS
- Ensure persistence mode on reboot
2018-04-19 12:24:06 -07:00
Fred Park 32029bbe82
Enable nvidia persistence mode through reboots 2018-04-19 08:47:52 -07:00
Fred Park 54aea32a20
Update blobxfer to 1.2.0 2018-04-19 08:44:04 -07:00
Fred Park 935805155f
Update to Py3.6.5 for Windows Docker
- Minor build fixes in Appveyor
2018-04-19 08:40:55 -07:00
Fred Park 5d4ede9acc
Migrate RemoteFS clusters to 18.04
- Resolves #185
2018-04-18 12:35:41 -07:00
Fred Park 7f2200a31d
Update recipes to refer to platform image docs
- Resolves #186
2018-04-18 12:35:26 -07:00
Fred Park 350f1185d9
Allow AAD on storage credentials
- Resolves #179
2018-04-18 08:09:13 -07:00
Fred Park 4c099034f5
Add Windows file version info to build 2018-04-17 10:49:11 -07:00
Fred Park ed477bbc22
Support JSON output for certain commands
- Resolves #177
2018-04-17 10:49:06 -07:00
Fred Park 727292f902
Fix task submission speed regression
- Resolves #183
2018-04-10 10:21:45 -07:00
Fred Park fc6c5969b5
Unify nodeprep scripts
- Resolves #176
2018-04-09 11:01:44 -07:00
Fred Park 8660d27abd
Fix env var export for tasks without env vars
- Fixes the cat message and declare dumps into stderr and stdout,
  respectively (#180)
2018-04-04 19:04:18 -07:00
Fred Park 383bcfd7c0
Update Singularity to 2.4.5 2018-04-04 13:34:35 -07:00