Граф коммитов

58 Коммитов

Автор SHA1 Сообщение Дата
Fred Park 7060366213
Update dependencies 2018-07-17 11:18:20 -07:00
Fred Park 66e77ac397
Update dependencies
- Fixes in scripts for cascade and monitor cert renewal
2018-06-27 15:32:22 -07:00
Fred Park 3f30ba8d07
Support a fallback registry for system images
- Resolves #217
- Add misc mirror-images command
- Pass Singularity version to bootstrap
- Fix GlusterFS on compute provisioning, resolves #220
2018-06-26 12:22:09 -07:00
Fred Park b77a147766
Continue Prometheus integration support
- Add nginx reverse proxy and letsencrypt cert support
- Add let's encrypt options
- Add picket into compose
- crontab cert renewal
- Add inbound rule management for temporary ACME challenge on port 80
- Update to node exporter 0.16
- Fixup various issues
2018-06-04 09:04:30 -07:00
Fred Park 0bf9399061
Fix some post-release issues
- Shellcheck-induced regressions in containers
- Split Docker image build into own build env on appveyor
- Update to Python 3.6.5 for cargo Docker image build on Windows
2018-05-02 10:24:38 -07:00
Fred Park 7c5ca646dc
Update Singularity to 2.5.0 2018-05-01 10:57:15 -07:00
Fred Park 383bcfd7c0
Update Singularity to 2.4.5 2018-04-04 13:34:35 -07:00
Fred Park c1a92e4138
Fix scripts to be Shellcheck clean (#178) 2018-04-04 13:34:09 -07:00
Fred Park 2b3ecac70b
Update Singularity to 2.4.4 2018-03-19 14:07:12 -07:00
Fred Park a195d7e242
Update dependencies 2018-02-28 15:08:33 -08:00
Fred Park b4e6e4320d
Various updates
- Fix image update to work in multi-instance mode with registry logins
- Allow CentOS 7.3 provisioning to continue to work
- Allow CentOS-HPC 7.1 provisioning
- Add CentOS 7.4 support
- Add Debian 9 support
- Update dependencies
2018-02-16 09:18:34 -08:00
Fred Park 5853b4787f
Update Docker images
- Update to alpine 3.7
- Update Windows images to Python 3.6.4
- Update libtorrent image
- Update to Singularity 2.4.2
2018-01-22 14:14:39 -08:00
Fred Park f9765c98a2 Fix file naming for Docker image under singularity 2017-11-15 10:01:13 -08:00
Fred Park 17497ff06d Fix non-Ubuntu/CentOS cascade failures 2017-11-13 09:25:25 -08:00
Fred Park 90283298e6 Update dependencies 2017-11-10 09:23:15 -08:00
Fred Park 5afa97de45 Fix default Singularity image names with tags 2017-11-09 13:07:10 -08:00
Fred Park 5b2af24f00 Retry image configuration errors
- Add TensorFlow-GPU Singularity recipe
2017-10-29 20:29:34 -07:00
Fred Park 6da607c9b9 Multi-instance/IB support for Singularity tasks
- Make cascade work in Docker container
2017-10-22 13:59:35 -07:00
Fred Park 48172e115e Add initial Singularity task support
- Auto-GPU
- Fix ownership issues with Singularity image pre-load
2017-10-20 23:10:12 -07:00
Fred Park 4e5d5abf6b Add Singularity support into cascade
- Remove singularity suport in native container support pools as it's
  impossible to execute a singularity container in this mode
2017-10-17 18:51:27 -07:00
Fred Park 607bfd252e Migrate to storage split library
- Remove queue deletion code
- Resolves #133
2017-10-05 21:40:50 -07:00
Fred Park 01995e97b6 Tag for 3.0.0a1 release 2017-10-04 09:26:01 -07:00
Fred Park e783744e00 Container registry logic overhaul
- Remove private registry back to Azure storage blob support (#44)
- Require fully qualified Docker image names (#106)
- Support multiple public/private registries on a single pool (#127)
2017-10-03 18:24:42 -07:00
Fred Park 9602608871 Tag for 2.9.4 release 2017-09-12 08:56:03 -07:00
Fred Park 832a32e375 Use multi-stage build for cascade 2017-08-15 19:14:37 -07:00
Fred Park e32fc4d93e Add Autopool support
- Resolves #33
- Add --poolid to storage clear and storage del
- jobs del and jobs term now cleanup storage data if autopool is
  detected
2017-07-21 11:10:03 -07:00
Fred Park 5291ff1130 Move to blob leasing for download ticketing
- Greatly increase resource file SAS expiry timedelta
- Make concurrent_source_downloads generic, remove non-p2p option
- Update Dockerfiles
- Update to latest azure-storage
2017-07-21 11:10:03 -07:00
Fred Park de45b18a67 Add backoff to cascade docker image pull retries 2017-07-01 01:25:30 -07:00
Fred Park 2a48885da1 More improvements for scale out robustness
- Add --all-start-task-failed to delnode
- Reduce node output on pool allocation wait with number of nodes > 10
2017-06-30 23:50:21 -07:00
Fred Park 06188c1944 Tag for 2.8.0rc2 release
- Fix regression with private docker image pulls
- Resolves #103
- Resolves #105
2017-06-30 11:45:26 -07:00
Fred Park 54422ce2eb Add retry handling for cascade docker pull
- Add cascade.log download for start up failures
2017-06-27 09:28:03 -07:00
Fred Park 94bd35e21c Update Dockerfiles to Alpine 3.6
- Resolves #65
2017-06-26 11:12:11 -07:00
Fred Park a17d6b64c9 Update dependencies
- Fix breaking changes in keyvault library
- Fix inverted order for fs cluster ssh and optional command
2017-05-11 09:21:20 -07:00
Fred Park 9acdc2e000 Dockerfile updates 2017-04-30 00:08:02 -07:00
Fred Park 7c7fac238c Minor doc updates 2017-04-04 07:26:32 -07:00
Fred Park 96395fa68a Allow docker_images to be empty 2017-04-03 14:20:20 -07:00
Fred Park cb7b42a231 Support glusterfs <-> pool autolinking
- Support glusterfs expand (additional disks)
- Provide `mount_options` for `file_server` which applies to local mount
on the file server of the disks
- Allow gluster volume name to be specified
- Provide stronger cross-checking between pool virtual network and
storage cluster virtual network
- Increase ud/fd in AS to maximums
- Install acl tools for nfsv4 and glusterfs
2017-03-11 15:23:55 -08:00
Fred Park 9088cde886 Add python precompile to Dockerfiles 2017-03-08 09:52:21 -08:00
Fred Park 453ae98a65 Terminate cascade on thread failures 2016-11-19 10:39:58 -08:00
Fred Park c7744f95bf Support for internet accessible private registries 2016-11-19 09:00:01 -08:00
Fred Park 4399dbf4db Tag for 2.0.0rc3 release
- Fix flake8 issues
2016-11-14 11:10:59 -08:00
Fred Park 03ced70c38 Continue settings refactor
- Credentials
- Some of global config
2016-11-11 21:08:58 -08:00
Fred Park da573524de Preliminary steps for ACR support
- Fix update docker images with private registry
- Automatically clean dangling image refs on update
- Remove private registry file/image id support
- Refactor fleet initialization steps to one entry point
- Simplify shipyard context init
2016-11-10 09:48:00 -08:00
Fred Park efb8c3105f Add wait option for pool resize
- Fix TMPDIR sed command
- Add generated shipyard script to gitignore
2016-10-30 01:44:57 -07:00
Fred Park 80931c544f Minor fixes/typos 2016-10-28 11:03:40 -07:00
Fred Park 0a702d1f8b Prep for multi image Batch-Shipyard docker repo 2016-10-25 15:02:28 -07:00
Fred Park 92464b3b54 Add Azure Batch Task data ingress
- Rearrange Dockerfiles
- Update TensorFlow-Distributed recipe
- Rename CASCADE env vars to SHIPYARD
2016-10-20 21:18:31 -07:00
Fred Park 4ce2f1d6c2 Add HPN-SSH support for Ubuntu
- Fix some issues with azure file setup and Windows
- Add some validation with container naming
- Clean up storage with delpool action
- Update .gitignore
2016-10-13 10:55:49 -07:00
Fred Park edec6f0584 Add scp and multinode_scp ingress support 2016-10-09 11:51:02 -07:00
Fred Park 646cff6631 Add TensorFlow-Distributed recipe
- Fix SSH user expiry within 1 day
- Fix some README/dockerfile typos
2016-09-13 11:43:28 -07:00