Fred Park
7060366213
Update dependencies
2018-07-17 11:18:20 -07:00
Fred Park
66e77ac397
Update dependencies
...
- Fixes in scripts for cascade and monitor cert renewal
2018-06-27 15:32:22 -07:00
Fred Park
3f30ba8d07
Support a fallback registry for system images
...
- Resolves #217
- Add misc mirror-images command
- Pass Singularity version to bootstrap
- Fix GlusterFS on compute provisioning, resolves #220
2018-06-26 12:22:09 -07:00
Fred Park
b77a147766
Continue Prometheus integration support
...
- Add nginx reverse proxy and letsencrypt cert support
- Add let's encrypt options
- Add picket into compose
- crontab cert renewal
- Add inbound rule management for temporary ACME challenge on port 80
- Update to node exporter 0.16
- Fixup various issues
2018-06-04 09:04:30 -07:00
Fred Park
0bf9399061
Fix some post-release issues
...
- Shellcheck-induced regressions in containers
- Split Docker image build into own build env on appveyor
- Update to Python 3.6.5 for cargo Docker image build on Windows
2018-05-02 10:24:38 -07:00
Fred Park
7c5ca646dc
Update Singularity to 2.5.0
2018-05-01 10:57:15 -07:00
Fred Park
383bcfd7c0
Update Singularity to 2.4.5
2018-04-04 13:34:35 -07:00
Fred Park
c1a92e4138
Fix scripts to be Shellcheck clean ( #178 )
2018-04-04 13:34:09 -07:00
Fred Park
2b3ecac70b
Update Singularity to 2.4.4
2018-03-19 14:07:12 -07:00
Fred Park
a195d7e242
Update dependencies
2018-02-28 15:08:33 -08:00
Fred Park
b4e6e4320d
Various updates
...
- Fix image update to work in multi-instance mode with registry logins
- Allow CentOS 7.3 provisioning to continue to work
- Allow CentOS-HPC 7.1 provisioning
- Add CentOS 7.4 support
- Add Debian 9 support
- Update dependencies
2018-02-16 09:18:34 -08:00
Fred Park
5853b4787f
Update Docker images
...
- Update to alpine 3.7
- Update Windows images to Python 3.6.4
- Update libtorrent image
- Update to Singularity 2.4.2
2018-01-22 14:14:39 -08:00
Fred Park
f9765c98a2
Fix file naming for Docker image under singularity
2017-11-15 10:01:13 -08:00
Fred Park
17497ff06d
Fix non-Ubuntu/CentOS cascade failures
2017-11-13 09:25:25 -08:00
Fred Park
90283298e6
Update dependencies
2017-11-10 09:23:15 -08:00
Fred Park
5afa97de45
Fix default Singularity image names with tags
2017-11-09 13:07:10 -08:00
Fred Park
5b2af24f00
Retry image configuration errors
...
- Add TensorFlow-GPU Singularity recipe
2017-10-29 20:29:34 -07:00
Fred Park
6da607c9b9
Multi-instance/IB support for Singularity tasks
...
- Make cascade work in Docker container
2017-10-22 13:59:35 -07:00
Fred Park
48172e115e
Add initial Singularity task support
...
- Auto-GPU
- Fix ownership issues with Singularity image pre-load
2017-10-20 23:10:12 -07:00
Fred Park
4e5d5abf6b
Add Singularity support into cascade
...
- Remove singularity suport in native container support pools as it's
impossible to execute a singularity container in this mode
2017-10-17 18:51:27 -07:00
Fred Park
607bfd252e
Migrate to storage split library
...
- Remove queue deletion code
- Resolves #133
2017-10-05 21:40:50 -07:00
Fred Park
01995e97b6
Tag for 3.0.0a1 release
2017-10-04 09:26:01 -07:00
Fred Park
e783744e00
Container registry logic overhaul
...
- Remove private registry back to Azure storage blob support (#44 )
- Require fully qualified Docker image names (#106 )
- Support multiple public/private registries on a single pool (#127 )
2017-10-03 18:24:42 -07:00
Fred Park
9602608871
Tag for 2.9.4 release
2017-09-12 08:56:03 -07:00
Fred Park
832a32e375
Use multi-stage build for cascade
2017-08-15 19:14:37 -07:00
Fred Park
e32fc4d93e
Add Autopool support
...
- Resolves #33
- Add --poolid to storage clear and storage del
- jobs del and jobs term now cleanup storage data if autopool is
detected
2017-07-21 11:10:03 -07:00
Fred Park
5291ff1130
Move to blob leasing for download ticketing
...
- Greatly increase resource file SAS expiry timedelta
- Make concurrent_source_downloads generic, remove non-p2p option
- Update Dockerfiles
- Update to latest azure-storage
2017-07-21 11:10:03 -07:00
Fred Park
de45b18a67
Add backoff to cascade docker image pull retries
2017-07-01 01:25:30 -07:00
Fred Park
2a48885da1
More improvements for scale out robustness
...
- Add --all-start-task-failed to delnode
- Reduce node output on pool allocation wait with number of nodes > 10
2017-06-30 23:50:21 -07:00
Fred Park
06188c1944
Tag for 2.8.0rc2 release
...
- Fix regression with private docker image pulls
- Resolves #103
- Resolves #105
2017-06-30 11:45:26 -07:00
Fred Park
54422ce2eb
Add retry handling for cascade docker pull
...
- Add cascade.log download for start up failures
2017-06-27 09:28:03 -07:00
Fred Park
94bd35e21c
Update Dockerfiles to Alpine 3.6
...
- Resolves #65
2017-06-26 11:12:11 -07:00
Fred Park
a17d6b64c9
Update dependencies
...
- Fix breaking changes in keyvault library
- Fix inverted order for fs cluster ssh and optional command
2017-05-11 09:21:20 -07:00
Fred Park
9acdc2e000
Dockerfile updates
2017-04-30 00:08:02 -07:00
Fred Park
7c7fac238c
Minor doc updates
2017-04-04 07:26:32 -07:00
Fred Park
96395fa68a
Allow docker_images to be empty
2017-04-03 14:20:20 -07:00
Fred Park
cb7b42a231
Support glusterfs <-> pool autolinking
...
- Support glusterfs expand (additional disks)
- Provide `mount_options` for `file_server` which applies to local mount
on the file server of the disks
- Allow gluster volume name to be specified
- Provide stronger cross-checking between pool virtual network and
storage cluster virtual network
- Increase ud/fd in AS to maximums
- Install acl tools for nfsv4 and glusterfs
2017-03-11 15:23:55 -08:00
Fred Park
9088cde886
Add python precompile to Dockerfiles
2017-03-08 09:52:21 -08:00
Fred Park
453ae98a65
Terminate cascade on thread failures
2016-11-19 10:39:58 -08:00
Fred Park
c7744f95bf
Support for internet accessible private registries
2016-11-19 09:00:01 -08:00
Fred Park
4399dbf4db
Tag for 2.0.0rc3 release
...
- Fix flake8 issues
2016-11-14 11:10:59 -08:00
Fred Park
03ced70c38
Continue settings refactor
...
- Credentials
- Some of global config
2016-11-11 21:08:58 -08:00
Fred Park
da573524de
Preliminary steps for ACR support
...
- Fix update docker images with private registry
- Automatically clean dangling image refs on update
- Remove private registry file/image id support
- Refactor fleet initialization steps to one entry point
- Simplify shipyard context init
2016-11-10 09:48:00 -08:00
Fred Park
efb8c3105f
Add wait option for pool resize
...
- Fix TMPDIR sed command
- Add generated shipyard script to gitignore
2016-10-30 01:44:57 -07:00
Fred Park
80931c544f
Minor fixes/typos
2016-10-28 11:03:40 -07:00
Fred Park
0a702d1f8b
Prep for multi image Batch-Shipyard docker repo
2016-10-25 15:02:28 -07:00
Fred Park
92464b3b54
Add Azure Batch Task data ingress
...
- Rearrange Dockerfiles
- Update TensorFlow-Distributed recipe
- Rename CASCADE env vars to SHIPYARD
2016-10-20 21:18:31 -07:00
Fred Park
4ce2f1d6c2
Add HPN-SSH support for Ubuntu
...
- Fix some issues with azure file setup and Windows
- Add some validation with container naming
- Clean up storage with delpool action
- Update .gitignore
2016-10-13 10:55:49 -07:00
Fred Park
edec6f0584
Add scp and multinode_scp ingress support
2016-10-09 11:51:02 -07:00
Fred Park
646cff6631
Add TensorFlow-Distributed recipe
...
- Fix SSH user expiry within 1 day
- Fix some README/dockerfile typos
2016-09-13 11:43:28 -07:00