Fred Park
c2cd8f5e67
Update TensorFlow Docker image ref
2018-06-11 13:37:24 -07:00
Fred Park
4573a5c95e
Support native mode platform images
...
- Auto convert if possible with native enablement
- Update docs
- Resolves #204
2018-06-11 13:32:33 -07:00
Fred Park
ca1e9504a7
Cache container/file share creation calls
...
- Resolves #211
2018-06-11 07:43:15 -07:00
Fred Park
612e2a50e5
Move Prometheus/Grafana config to separate file
...
- Move grafana admin login info to credentials
- Update documentation for Prometheus/Grafana integration
- Resolves #205
2018-06-08 16:42:52 -07:00
Fred Park
449e621a66
Add suspend/restart support for monitoring
2018-06-08 07:47:15 -07:00
Fred Park
cf0797790f
Support max increment of VMs in scenario autoscale
...
- Allow definition of weekdays/workhours
- Resolves #210
2018-06-08 07:20:41 -07:00
Fred Park
e65dd9c196
Add CentOS-HPC 7.4 Support
...
- Fixup CentOS 7.4 GPU support
- Update LIS to 4.2.5
- Update packer scripts
- Resolves #184
2018-06-07 13:31:01 -07:00
Fred Park
520fe45c9d
Install into virutalenv by default
...
- Add ludicrous speed quickstart
- Resolves #200
2018-06-07 11:15:30 -07:00
Fred Park
911bcc8593
Fix blobxfer script regression from shellcheck
2018-06-07 10:50:42 -07:00
Fred Park
a7a513b804
Don't allow docker only on cAdvisor
...
- Breaks grafana dashboard
2018-06-07 10:50:42 -07:00
Fred Park
9f61db12c3
Autoprovision Grafana Dashboard
...
- Add default dashboard
- Allow arbitrary provisioning of additional dashboards
- Add monitor list command
- Add RemoteFS monitoring support
- Compact cadvisor
2018-06-07 10:50:37 -07:00
Fred Park
b77a147766
Continue Prometheus integration support
...
- Add nginx reverse proxy and letsencrypt cert support
- Add let's encrypt options
- Add picket into compose
- crontab cert renewal
- Add inbound rule management for temporary ACME challenge on port 80
- Update to node exporter 0.16
- Fixup various issues
2018-06-04 09:04:30 -07:00
Fred Park
ba9dc76f7c
Update gluster to 4.0
2018-06-04 09:03:03 -07:00
Fred Park
d4c6aa99ae
Support CentOS 7.4 GPU
...
- Add LIS installation support
- Update NC driver to 396.26
- Update dependencies
- Resolves #199
2018-06-04 09:03:03 -07:00
Fred Park
95382bf639
Fix appveyor build
2018-06-04 09:03:02 -07:00
Fred Park
b7266d2c96
Fix conf load from keyvault
...
- Improve error messages
- Add missing aad and keyvault decorators to commands
2018-06-04 09:03:02 -07:00
Fred Park
ae86b92be2
Start Prometheus monitoring integration
...
- Refactor package uploader for pool
- Auto install node exporter and cadvisor for prom enabled pools
- Add configuration
- Create monitoring resource
- Start work on picket monitor
2018-06-04 08:56:44 -07:00
Fred Park
32fa9fcaa1
account_key and aad in batch interaction
...
- These options are now mutually exclusive
- Addresses #197
2018-05-17 09:38:13 -07:00
Fred Park
0bf9399061
Fix some post-release issues
...
- Shellcheck-induced regressions in containers
- Split Docker image build into own build env on appveyor
- Update to Python 3.6.5 for cargo Docker image build on Windows
2018-05-02 10:24:38 -07:00
Fred Park
bbcb479e62
Tag for 3.5.0b1 release
2018-05-02 08:16:04 -07:00
Fred Park
5168320335
Improve site extension installation robustness
...
- Fix errorlevel checking
- Allow nuget package on pre-releases
2018-05-02 08:15:59 -07:00
Fred Park
c84f3b62b3
Update docs
2018-05-02 07:47:26 -07:00
Fred Park
d7c3a779b9
Update Dockerfiles to reduce clone depth
2018-05-02 07:46:53 -07:00
Fred Park
414d1ed9fd
Update dependencies
...
- Update 3rd party notices
- Unpin pip for travis
2018-05-01 13:45:11 -07:00
Fred Park
648941371c
Extend retry policy for all clients
...
- Minor fix in node prep
2018-05-01 13:18:50 -07:00
Fred Park
7c5ca646dc
Update Singularity to 2.5.0
2018-05-01 10:57:15 -07:00
Fred Park
c94508891b
Pin nvidia-docker2 installations
...
- Use data-root instead of graph in daemon config
- Restart docker service instead of sighup after nvidia-docker2 install
- Update Docker CE to 18.03.1
2018-05-01 09:31:45 -07:00
Fred Park
accc48773b
Remove Ubuntu 14.04 support
...
- Resolves #164
2018-04-27 13:01:40 -07:00
Fred Park
e2e8f90b62
Add boot diagnostics support
...
- Move to Ubuntu 18.04-LTS stable
- Enables serial console access
- Resolves #193
2018-04-27 12:59:58 -07:00
Fred Park
69235ba2c1
Add default_working_dir option
...
- Clarify where a container runs by default in the jobs config doc
- Resolves #190
2018-04-25 08:23:17 -07:00
Fred Park
8cd1884ac9
Fix documentation for registry creds ( #189 )
2018-04-25 07:40:02 -07:00
Fred Park
3a90ebda87
Async support for task file mover ( #188 )
2018-04-24 08:03:35 -07:00
Fred Park
2f800e1df8
Async add task for recurrent job manager ( #188 )
2018-04-24 07:54:23 -07:00
Fred Park
a676a5a8d9
Refactor max workers call ( #188 )
...
- Add async to del certs
2018-04-20 14:41:18 -07:00
Fred Park
72484b510b
Add product_iterables support for task factory
...
- Resolves #187
2018-04-20 11:21:34 -07:00
Fred Park
1756e57e92
Call concurrent actions asynchronously
...
- Resolves #188
- Add -no-generate-tunnel-script option to pool nodes grls
2018-04-20 10:32:47 -07:00
Fred Park
97d9ca09ce
Relax zip iterable type
...
- Partially addresses #187
2018-04-19 14:34:57 -07:00
Fred Park
073a37b1a3
Update Nvidia drivers
...
- CUDA 9.1 support for NCv1
- Update NV driver for CUDA 9.1 support and work with latest Ubuntu
16.04 releases
- Fix blacklisting of nouveau on CentOS
- Ensure persistence mode on reboot
2018-04-19 12:24:06 -07:00
Fred Park
32029bbe82
Enable nvidia persistence mode through reboots
2018-04-19 08:47:52 -07:00
Fred Park
54aea32a20
Update blobxfer to 1.2.0
2018-04-19 08:44:04 -07:00
Fred Park
935805155f
Update to Py3.6.5 for Windows Docker
...
- Minor build fixes in Appveyor
2018-04-19 08:40:55 -07:00
Fred Park
5d4ede9acc
Migrate RemoteFS clusters to 18.04
...
- Resolves #185
2018-04-18 12:35:41 -07:00
Fred Park
7f2200a31d
Update recipes to refer to platform image docs
...
- Resolves #186
2018-04-18 12:35:26 -07:00
Fred Park
350f1185d9
Allow AAD on storage credentials
...
- Resolves #179
2018-04-18 08:09:13 -07:00
Fred Park
4c099034f5
Add Windows file version info to build
2018-04-17 10:49:11 -07:00
Fred Park
ed477bbc22
Support JSON output for certain commands
...
- Resolves #177
2018-04-17 10:49:06 -07:00
Fred Park
727292f902
Fix task submission speed regression
...
- Resolves #183
2018-04-10 10:21:45 -07:00
Fred Park
fc6c5969b5
Unify nodeprep scripts
...
- Resolves #176
2018-04-09 11:01:44 -07:00
Fred Park
8660d27abd
Fix env var export for tasks without env vars
...
- Fixes the cat message and declare dumps into stderr and stdout,
respectively (#180 )
2018-04-04 19:04:18 -07:00
Fred Park
383bcfd7c0
Update Singularity to 2.4.5
2018-04-04 13:34:35 -07:00