Граф коммитов

48 Коммитов

Автор SHA1 Сообщение Дата
Fred Park b4e6e4320d
Various updates
- Fix image update to work in multi-instance mode with registry logins
- Allow CentOS 7.3 provisioning to continue to work
- Allow CentOS-HPC 7.1 provisioning
- Add CentOS 7.4 support
- Add Debian 9 support
- Update dependencies
2018-02-16 09:18:34 -08:00
Fred Park 5853b4787f
Update Docker images
- Update to alpine 3.7
- Update Windows images to Python 3.6.4
- Update libtorrent image
- Update to Singularity 2.4.2
2018-01-22 14:14:39 -08:00
Fred Park f9765c98a2 Fix file naming for Docker image under singularity 2017-11-15 10:01:13 -08:00
Fred Park 17497ff06d Fix non-Ubuntu/CentOS cascade failures 2017-11-13 09:25:25 -08:00
Fred Park 90283298e6 Update dependencies 2017-11-10 09:23:15 -08:00
Fred Park 5afa97de45 Fix default Singularity image names with tags 2017-11-09 13:07:10 -08:00
Fred Park 5b2af24f00 Retry image configuration errors
- Add TensorFlow-GPU Singularity recipe
2017-10-29 20:29:34 -07:00
Fred Park 6da607c9b9 Multi-instance/IB support for Singularity tasks
- Make cascade work in Docker container
2017-10-22 13:59:35 -07:00
Fred Park 48172e115e Add initial Singularity task support
- Auto-GPU
- Fix ownership issues with Singularity image pre-load
2017-10-20 23:10:12 -07:00
Fred Park 4e5d5abf6b Add Singularity support into cascade
- Remove singularity suport in native container support pools as it's
  impossible to execute a singularity container in this mode
2017-10-17 18:51:27 -07:00
Fred Park 607bfd252e Migrate to storage split library
- Remove queue deletion code
- Resolves #133
2017-10-05 21:40:50 -07:00
Fred Park 01995e97b6 Tag for 3.0.0a1 release 2017-10-04 09:26:01 -07:00
Fred Park e783744e00 Container registry logic overhaul
- Remove private registry back to Azure storage blob support (#44)
- Require fully qualified Docker image names (#106)
- Support multiple public/private registries on a single pool (#127)
2017-10-03 18:24:42 -07:00
Fred Park 9602608871 Tag for 2.9.4 release 2017-09-12 08:56:03 -07:00
Fred Park 832a32e375 Use multi-stage build for cascade 2017-08-15 19:14:37 -07:00
Fred Park e32fc4d93e Add Autopool support
- Resolves #33
- Add --poolid to storage clear and storage del
- jobs del and jobs term now cleanup storage data if autopool is
  detected
2017-07-21 11:10:03 -07:00
Fred Park 5291ff1130 Move to blob leasing for download ticketing
- Greatly increase resource file SAS expiry timedelta
- Make concurrent_source_downloads generic, remove non-p2p option
- Update Dockerfiles
- Update to latest azure-storage
2017-07-21 11:10:03 -07:00
Fred Park de45b18a67 Add backoff to cascade docker image pull retries 2017-07-01 01:25:30 -07:00
Fred Park 2a48885da1 More improvements for scale out robustness
- Add --all-start-task-failed to delnode
- Reduce node output on pool allocation wait with number of nodes > 10
2017-06-30 23:50:21 -07:00
Fred Park 06188c1944 Tag for 2.8.0rc2 release
- Fix regression with private docker image pulls
- Resolves #103
- Resolves #105
2017-06-30 11:45:26 -07:00
Fred Park 54422ce2eb Add retry handling for cascade docker pull
- Add cascade.log download for start up failures
2017-06-27 09:28:03 -07:00
Fred Park 94bd35e21c Update Dockerfiles to Alpine 3.6
- Resolves #65
2017-06-26 11:12:11 -07:00
Fred Park a17d6b64c9 Update dependencies
- Fix breaking changes in keyvault library
- Fix inverted order for fs cluster ssh and optional command
2017-05-11 09:21:20 -07:00
Fred Park 9acdc2e000 Dockerfile updates 2017-04-30 00:08:02 -07:00
Fred Park 7c7fac238c Minor doc updates 2017-04-04 07:26:32 -07:00
Fred Park 96395fa68a Allow docker_images to be empty 2017-04-03 14:20:20 -07:00
Fred Park cb7b42a231 Support glusterfs <-> pool autolinking
- Support glusterfs expand (additional disks)
- Provide `mount_options` for `file_server` which applies to local mount
on the file server of the disks
- Allow gluster volume name to be specified
- Provide stronger cross-checking between pool virtual network and
storage cluster virtual network
- Increase ud/fd in AS to maximums
- Install acl tools for nfsv4 and glusterfs
2017-03-11 15:23:55 -08:00
Fred Park 9088cde886 Add python precompile to Dockerfiles 2017-03-08 09:52:21 -08:00
Fred Park 453ae98a65 Terminate cascade on thread failures 2016-11-19 10:39:58 -08:00
Fred Park c7744f95bf Support for internet accessible private registries 2016-11-19 09:00:01 -08:00
Fred Park 4399dbf4db Tag for 2.0.0rc3 release
- Fix flake8 issues
2016-11-14 11:10:59 -08:00
Fred Park 03ced70c38 Continue settings refactor
- Credentials
- Some of global config
2016-11-11 21:08:58 -08:00
Fred Park da573524de Preliminary steps for ACR support
- Fix update docker images with private registry
- Automatically clean dangling image refs on update
- Remove private registry file/image id support
- Refactor fleet initialization steps to one entry point
- Simplify shipyard context init
2016-11-10 09:48:00 -08:00
Fred Park efb8c3105f Add wait option for pool resize
- Fix TMPDIR sed command
- Add generated shipyard script to gitignore
2016-10-30 01:44:57 -07:00
Fred Park 80931c544f Minor fixes/typos 2016-10-28 11:03:40 -07:00
Fred Park 0a702d1f8b Prep for multi image Batch-Shipyard docker repo 2016-10-25 15:02:28 -07:00
Fred Park 92464b3b54 Add Azure Batch Task data ingress
- Rearrange Dockerfiles
- Update TensorFlow-Distributed recipe
- Rename CASCADE env vars to SHIPYARD
2016-10-20 21:18:31 -07:00
Fred Park 4ce2f1d6c2 Add HPN-SSH support for Ubuntu
- Fix some issues with azure file setup and Windows
- Add some validation with container naming
- Clean up storage with delpool action
- Update .gitignore
2016-10-13 10:55:49 -07:00
Fred Park edec6f0584 Add scp and multinode_scp ingress support 2016-10-09 11:51:02 -07:00
Fred Park 646cff6631 Add TensorFlow-Distributed recipe
- Fix SSH user expiry within 1 day
- Fix some README/dockerfile typos
2016-09-13 11:43:28 -07:00
Fred Park 028768a61a Add CNTK recipes
- Add TCP optimization
- Fix job autocompletion
- Update azure-storage requirement to 0.33.0
2016-09-07 21:40:57 -07:00
Fred Park eea254aeb9 Add job auto-completion for multi-instance tasks 2016-09-06 10:32:42 -07:00
Fred Park e52e30cf0c Fix docker images issue with non-p2p transfer
- Fix node prep cascade timing issues
- Update various READMEs
2016-09-02 15:02:07 -07:00
Fred Park f9dac5bd93 Add GPU documentation
- Fix node prep issues with GPU
- Correct node prep finished file location
- Add TensorFlow-GPU recipe
2016-09-02 09:39:35 -07:00
Fred Park 7f49641074 First part of the guide/docs
- Modify placement of some configuration settings
2016-08-31 15:35:33 -07:00
Fred Park 35fb3f588b Add support for more host OSes
- Ubuntu 14.04, Debian 8, CentOS 7.x, RHEL 7.x, OpenSUSE 13.2/42.1,
  SLES 12/12-sp1
- Improve graphing
- Prevent metadata clear on existing pool
2016-08-28 19:43:53 -07:00
Fred Park 4cb1f79c07 Update docstrings and typing information
- Add MIT license text to all py files
2016-08-27 17:42:36 -07:00
Fred Park 703e9e7fd1 Reorganize project 2016-08-27 11:35:32 -07:00