Fred Park
e65dd9c196
Add CentOS-HPC 7.4 Support
...
- Fixup CentOS 7.4 GPU support
- Update LIS to 4.2.5
- Update packer scripts
- Resolves #184
2018-06-07 13:31:01 -07:00
Fred Park
911bcc8593
Fix blobxfer script regression from shellcheck
2018-06-07 10:50:42 -07:00
Fred Park
9f61db12c3
Autoprovision Grafana Dashboard
...
- Add default dashboard
- Allow arbitrary provisioning of additional dashboards
- Add monitor list command
- Add RemoteFS monitoring support
- Compact cadvisor
2018-06-07 10:50:37 -07:00
Fred Park
b77a147766
Continue Prometheus integration support
...
- Add nginx reverse proxy and letsencrypt cert support
- Add let's encrypt options
- Add picket into compose
- crontab cert renewal
- Add inbound rule management for temporary ACME challenge on port 80
- Update to node exporter 0.16
- Fixup various issues
2018-06-04 09:04:30 -07:00
Fred Park
ba9dc76f7c
Update gluster to 4.0
2018-06-04 09:03:03 -07:00
Fred Park
d4c6aa99ae
Support CentOS 7.4 GPU
...
- Add LIS installation support
- Update NC driver to 396.26
- Update dependencies
- Resolves #199
2018-06-04 09:03:03 -07:00
Fred Park
ae86b92be2
Start Prometheus monitoring integration
...
- Refactor package uploader for pool
- Auto install node exporter and cadvisor for prom enabled pools
- Add configuration
- Create monitoring resource
- Start work on picket monitor
2018-06-04 08:56:44 -07:00
Fred Park
648941371c
Extend retry policy for all clients
...
- Minor fix in node prep
2018-05-01 13:18:50 -07:00
Fred Park
7c5ca646dc
Update Singularity to 2.5.0
2018-05-01 10:57:15 -07:00
Fred Park
c94508891b
Pin nvidia-docker2 installations
...
- Use data-root instead of graph in daemon config
- Restart docker service instead of sighup after nvidia-docker2 install
- Update Docker CE to 18.03.1
2018-05-01 09:31:45 -07:00
Fred Park
accc48773b
Remove Ubuntu 14.04 support
...
- Resolves #164
2018-04-27 13:01:40 -07:00
Fred Park
073a37b1a3
Update Nvidia drivers
...
- CUDA 9.1 support for NCv1
- Update NV driver for CUDA 9.1 support and work with latest Ubuntu
16.04 releases
- Fix blacklisting of nouveau on CentOS
- Ensure persistence mode on reboot
2018-04-19 12:24:06 -07:00
Fred Park
32029bbe82
Enable nvidia persistence mode through reboots
2018-04-19 08:47:52 -07:00
Fred Park
5d4ede9acc
Migrate RemoteFS clusters to 18.04
...
- Resolves #185
2018-04-18 12:35:41 -07:00
Fred Park
fc6c5969b5
Unify nodeprep scripts
...
- Resolves #176
2018-04-09 11:01:44 -07:00
Fred Park
383bcfd7c0
Update Singularity to 2.4.5
2018-04-04 13:34:35 -07:00
Fred Park
c1a92e4138
Fix scripts to be Shellcheck clean ( #178 )
2018-04-04 13:34:09 -07:00
Fred Park
a98bbb5242
Disable kernel unattended upgrade on custom/native
...
- Resolves #174
2018-03-29 08:46:54 -07:00
Fred Park
a3dfa8c35f
Ensure nvidia driver is avail through upgrades
...
- Resolves #174
2018-03-28 10:14:12 -07:00
Fred Park
f1c27c366e
Update to Docker CE 18.03.0 for Ubuntu/CentOS
2018-03-26 09:26:38 -07:00
Fred Park
287c86ce0c
Fix NFS exports multi-target parsing
2018-03-26 09:25:06 -07:00
Fred Park
330c193422
Improve prep scripts
...
- Add timestamps for logging
- Add more Docker and nvidia details
- Save prior startup logs
- Update dependencies
2018-03-22 10:45:28 -07:00
Fred Park
2b3ecac70b
Update Singularity to 2.4.4
2018-03-19 14:07:12 -07:00
Fred Park
088f2d5e34
Add support for arbitrary exports config for NFS
...
- Add user agent for ARM clients
- Update README
2018-03-13 15:03:31 -07:00
Fred Park
78b3342666
Update Docker CE to 17.12.1
2018-03-05 13:50:11 -08:00
Fred Park
f0c9656ca2
Fix nvidia-docker overwriting daemon.json
...
- Update packer scripts
2018-02-28 15:09:22 -08:00
Fred Park
a195d7e242
Update dependencies
2018-02-28 15:08:33 -08:00
Fred Park
8664c6cfa6
Add additional TLS modes in powershell
...
- Resolves #171
2018-02-28 09:54:31 -08:00
Fred Park
370da96ed5
Do not automatically mount added fstab entries
...
- Disable Docker service install in non-native mode and always manually
start
2018-02-28 09:54:30 -08:00
Fred Park
cb04700f08
Improve node prep scripts
...
- Migrate to daemon.json files
- Fix missing blobfuse mount in native mode
- Ensure docker check happens every boot
2018-02-23 12:25:08 -08:00
Fred Park
53fcd2c313
Fix interaction between custom image and native
...
- Enable/start docker service on custom image in native mode if not
found
2018-02-22 18:51:33 -08:00
Fred Park
f3ab5ef489
Fix image update/list
...
- Fix issue with pure docker pools
- Fix updates/list over SSH for older distros requiring pseudo-tty
2018-02-16 09:18:41 -08:00
Fred Park
b4e6e4320d
Various updates
...
- Fix image update to work in multi-instance mode with registry logins
- Allow CentOS 7.3 provisioning to continue to work
- Allow CentOS-HPC 7.1 provisioning
- Add CentOS 7.4 support
- Add Debian 9 support
- Update dependencies
2018-02-16 09:18:34 -08:00
Fred Park
ae20b27643
Add Custom Linux Mount support
...
- Add pre/post support for additional node prep commands
2018-02-12 11:20:34 -08:00
Fred Park
b3b98162c6
Add support for native mode image update
...
- Fix custom image + native pool startup for Linux
- pool images update over SSH fix
2018-02-08 14:25:28 -08:00
Fred Park
b40a812d3d
Tag for 3.1.0 release
2018-01-30 13:06:10 -08:00
Fred Park
b59d42022c
Upgrade nvidia-docker to nvidia-docker2
2018-01-29 12:42:00 -08:00
Fred Park
4d5b704905
Add support for Azure blob container mounts
...
- Support via blobfuse
- Resolves #159
2018-01-23 14:29:38 -08:00
Fred Park
20a2324eb7
Update Docker CE and blobxfer
2018-01-22 16:39:31 -08:00
Fred Park
5853b4787f
Update Docker images
...
- Update to alpine 3.7
- Update Windows images to Python 3.6.4
- Update libtorrent image
- Update to Singularity 2.4.2
2018-01-22 14:14:39 -08:00
Fred Park
a731ecc5f5
Support more than 16 disks per fileserver
2017-11-17 09:12:42 -08:00
Fred Park
954275696c
Ensure persistence daemon/mode
2017-11-13 09:25:25 -08:00
Fred Park
90283298e6
Update dependencies
2017-11-10 09:23:15 -08:00
Fred Park
2e8b43df55
Add Azure File mount support for Windows pools
...
- Fix coordination command of None issue
2017-11-05 10:38:22 -08:00
Fred Park
3e94831dc3
Build cargo Docker image for Windows
2017-11-03 16:20:50 -07:00
Fred Park
0411de5828
Windows task execution support
...
- Blobxfer on windows support
- Disable all native updateimages/udi
2017-11-03 16:20:27 -07:00
Fred Park
c9443fc91a
Initial Windows Server Container support
...
- pool updateimages command supporting singularity images
- fix aad mfa token cache on python2
2017-11-03 16:20:10 -07:00
Fred Park
5b2af24f00
Retry image configuration errors
...
- Add TensorFlow-GPU Singularity recipe
2017-10-29 20:29:34 -07:00
Fred Park
edd602aed4
Add HPCG Singularity recipe
2017-10-29 09:38:46 -07:00
Fred Park
6da607c9b9
Multi-instance/IB support for Singularity tasks
...
- Make cascade work in Docker container
2017-10-22 13:59:35 -07:00