Fred Park
b4e6e4320d
Various updates
...
- Fix image update to work in multi-instance mode with registry logins
- Allow CentOS 7.3 provisioning to continue to work
- Allow CentOS-HPC 7.1 provisioning
- Add CentOS 7.4 support
- Add Debian 9 support
- Update dependencies
2018-02-16 09:18:34 -08:00
Fred Park
5853b4787f
Update Docker images
...
- Update to alpine 3.7
- Update Windows images to Python 3.6.4
- Update libtorrent image
- Update to Singularity 2.4.2
2018-01-22 14:14:39 -08:00
Fred Park
f9765c98a2
Fix file naming for Docker image under singularity
2017-11-15 10:01:13 -08:00
Fred Park
17497ff06d
Fix non-Ubuntu/CentOS cascade failures
2017-11-13 09:25:25 -08:00
Fred Park
90283298e6
Update dependencies
2017-11-10 09:23:15 -08:00
Fred Park
5afa97de45
Fix default Singularity image names with tags
2017-11-09 13:07:10 -08:00
Fred Park
5b2af24f00
Retry image configuration errors
...
- Add TensorFlow-GPU Singularity recipe
2017-10-29 20:29:34 -07:00
Fred Park
6da607c9b9
Multi-instance/IB support for Singularity tasks
...
- Make cascade work in Docker container
2017-10-22 13:59:35 -07:00
Fred Park
48172e115e
Add initial Singularity task support
...
- Auto-GPU
- Fix ownership issues with Singularity image pre-load
2017-10-20 23:10:12 -07:00
Fred Park
4e5d5abf6b
Add Singularity support into cascade
...
- Remove singularity suport in native container support pools as it's
impossible to execute a singularity container in this mode
2017-10-17 18:51:27 -07:00
Fred Park
607bfd252e
Migrate to storage split library
...
- Remove queue deletion code
- Resolves #133
2017-10-05 21:40:50 -07:00
Fred Park
01995e97b6
Tag for 3.0.0a1 release
2017-10-04 09:26:01 -07:00
Fred Park
e783744e00
Container registry logic overhaul
...
- Remove private registry back to Azure storage blob support (#44 )
- Require fully qualified Docker image names (#106 )
- Support multiple public/private registries on a single pool (#127 )
2017-10-03 18:24:42 -07:00
Fred Park
9602608871
Tag for 2.9.4 release
2017-09-12 08:56:03 -07:00
Fred Park
832a32e375
Use multi-stage build for cascade
2017-08-15 19:14:37 -07:00
Fred Park
e32fc4d93e
Add Autopool support
...
- Resolves #33
- Add --poolid to storage clear and storage del
- jobs del and jobs term now cleanup storage data if autopool is
detected
2017-07-21 11:10:03 -07:00
Fred Park
5291ff1130
Move to blob leasing for download ticketing
...
- Greatly increase resource file SAS expiry timedelta
- Make concurrent_source_downloads generic, remove non-p2p option
- Update Dockerfiles
- Update to latest azure-storage
2017-07-21 11:10:03 -07:00
Fred Park
de45b18a67
Add backoff to cascade docker image pull retries
2017-07-01 01:25:30 -07:00
Fred Park
2a48885da1
More improvements for scale out robustness
...
- Add --all-start-task-failed to delnode
- Reduce node output on pool allocation wait with number of nodes > 10
2017-06-30 23:50:21 -07:00
Fred Park
06188c1944
Tag for 2.8.0rc2 release
...
- Fix regression with private docker image pulls
- Resolves #103
- Resolves #105
2017-06-30 11:45:26 -07:00
Fred Park
54422ce2eb
Add retry handling for cascade docker pull
...
- Add cascade.log download for start up failures
2017-06-27 09:28:03 -07:00
Fred Park
94bd35e21c
Update Dockerfiles to Alpine 3.6
...
- Resolves #65
2017-06-26 11:12:11 -07:00
Fred Park
a17d6b64c9
Update dependencies
...
- Fix breaking changes in keyvault library
- Fix inverted order for fs cluster ssh and optional command
2017-05-11 09:21:20 -07:00
Fred Park
9acdc2e000
Dockerfile updates
2017-04-30 00:08:02 -07:00
Fred Park
7c7fac238c
Minor doc updates
2017-04-04 07:26:32 -07:00
Fred Park
96395fa68a
Allow docker_images to be empty
2017-04-03 14:20:20 -07:00
Fred Park
cb7b42a231
Support glusterfs <-> pool autolinking
...
- Support glusterfs expand (additional disks)
- Provide `mount_options` for `file_server` which applies to local mount
on the file server of the disks
- Allow gluster volume name to be specified
- Provide stronger cross-checking between pool virtual network and
storage cluster virtual network
- Increase ud/fd in AS to maximums
- Install acl tools for nfsv4 and glusterfs
2017-03-11 15:23:55 -08:00
Fred Park
9088cde886
Add python precompile to Dockerfiles
2017-03-08 09:52:21 -08:00
Fred Park
453ae98a65
Terminate cascade on thread failures
2016-11-19 10:39:58 -08:00
Fred Park
c7744f95bf
Support for internet accessible private registries
2016-11-19 09:00:01 -08:00
Fred Park
4399dbf4db
Tag for 2.0.0rc3 release
...
- Fix flake8 issues
2016-11-14 11:10:59 -08:00
Fred Park
03ced70c38
Continue settings refactor
...
- Credentials
- Some of global config
2016-11-11 21:08:58 -08:00
Fred Park
da573524de
Preliminary steps for ACR support
...
- Fix update docker images with private registry
- Automatically clean dangling image refs on update
- Remove private registry file/image id support
- Refactor fleet initialization steps to one entry point
- Simplify shipyard context init
2016-11-10 09:48:00 -08:00
Fred Park
efb8c3105f
Add wait option for pool resize
...
- Fix TMPDIR sed command
- Add generated shipyard script to gitignore
2016-10-30 01:44:57 -07:00
Fred Park
80931c544f
Minor fixes/typos
2016-10-28 11:03:40 -07:00
Fred Park
0a702d1f8b
Prep for multi image Batch-Shipyard docker repo
2016-10-25 15:02:28 -07:00
Fred Park
92464b3b54
Add Azure Batch Task data ingress
...
- Rearrange Dockerfiles
- Update TensorFlow-Distributed recipe
- Rename CASCADE env vars to SHIPYARD
2016-10-20 21:18:31 -07:00
Fred Park
4ce2f1d6c2
Add HPN-SSH support for Ubuntu
...
- Fix some issues with azure file setup and Windows
- Add some validation with container naming
- Clean up storage with delpool action
- Update .gitignore
2016-10-13 10:55:49 -07:00
Fred Park
edec6f0584
Add scp and multinode_scp ingress support
2016-10-09 11:51:02 -07:00
Fred Park
646cff6631
Add TensorFlow-Distributed recipe
...
- Fix SSH user expiry within 1 day
- Fix some README/dockerfile typos
2016-09-13 11:43:28 -07:00
Fred Park
028768a61a
Add CNTK recipes
...
- Add TCP optimization
- Fix job autocompletion
- Update azure-storage requirement to 0.33.0
2016-09-07 21:40:57 -07:00
Fred Park
eea254aeb9
Add job auto-completion for multi-instance tasks
2016-09-06 10:32:42 -07:00
Fred Park
e52e30cf0c
Fix docker images issue with non-p2p transfer
...
- Fix node prep cascade timing issues
- Update various READMEs
2016-09-02 15:02:07 -07:00
Fred Park
f9dac5bd93
Add GPU documentation
...
- Fix node prep issues with GPU
- Correct node prep finished file location
- Add TensorFlow-GPU recipe
2016-09-02 09:39:35 -07:00
Fred Park
7f49641074
First part of the guide/docs
...
- Modify placement of some configuration settings
2016-08-31 15:35:33 -07:00
Fred Park
35fb3f588b
Add support for more host OSes
...
- Ubuntu 14.04, Debian 8, CentOS 7.x, RHEL 7.x, OpenSUSE 13.2/42.1,
SLES 12/12-sp1
- Improve graphing
- Prevent metadata clear on existing pool
2016-08-28 19:43:53 -07:00
Fred Park
4cb1f79c07
Update docstrings and typing information
...
- Add MIT license text to all py files
2016-08-27 17:42:36 -07:00
Fred Park
703e9e7fd1
Reorganize project
2016-08-27 11:35:32 -07:00