Fred Park
8cb2ba9583
Allow GPU property to be optional for NC VMs
...
- Update all GPU compute recipes to omit gpu driver
2016-11-20 09:00:19 -08:00
Fred Park
453ae98a65
Terminate cascade on thread failures
2016-11-19 10:39:58 -08:00
Fred Park
c7744f95bf
Support for internet accessible private registries
2016-11-19 09:00:01 -08:00
Fred Park
4399dbf4db
Tag for 2.0.0rc3 release
...
- Fix flake8 issues
2016-11-14 11:10:59 -08:00
Fred Park
0ae05f2d84
Add CUDA_CACHE_ vars for GPU tasks
2016-11-13 11:42:56 -08:00
Fred Park
ff43949d0e
Add Keras+Theano recipes
2016-11-13 01:43:01 -08:00
Fred Park
b8fcdede8f
Add --tail option for jobs add
...
- Simplify quickstart with --tail
2016-11-12 22:35:36 -08:00
Fred Park
e6593281eb
Refactor direct config access out of data/storage
2016-11-12 12:35:56 -08:00
Fred Park
03ced70c38
Continue settings refactor
...
- Credentials
- Some of global config
2016-11-11 21:08:58 -08:00
Fred Park
e700ee05b7
Add docker login prior to image update
...
- Move docker hub creds to credentials json
- Begin refactor of configuration settings retrieval
2016-11-11 09:30:14 -08:00
Fred Park
da573524de
Preliminary steps for ACR support
...
- Fix update docker images with private registry
- Automatically clean dangling image refs on update
- Remove private registry file/image id support
- Refactor fleet initialization steps to one entry point
- Simplify shipyard context init
2016-11-10 09:48:00 -08:00
Fred Park
f8ac2ccc40
Add support for single node direct ingress
...
- Add missing support for relative_destination_path in single node
transfers
2016-11-09 15:38:19 -08:00
Fred Park
ec60383723
Add pool udi command and listtasks job scoping
...
- Update docs
2016-11-09 11:59:01 -08:00
Fred Park
bbd948fdab
Add jobid/taskid scoping to data listfiles
...
- Update some docs
2016-11-08 15:06:40 -08:00
Fred Park
47f12b0bcb
Fix pool resize down with wait
2016-11-07 18:34:35 -08:00
Fred Park
9db24df307
Add relative destination path
...
- Check for vm_count for glusterfs setup
2016-11-07 10:23:43 -08:00
Fred Park
4742b77b17
Minor updates
2016-11-04 07:25:55 -07:00
Fred Park
162219f1b8
Add HPCG benchmark recipe
...
- Prepare for 2.0.0rc2 release
2016-11-02 08:47:39 -07:00
Fred Park
9c604d2572
More comprehensive pool resize logic with SSH user
2016-11-01 17:10:03 -07:00
Fred Park
155fabfb8d
Add HPL benchmark recipe
...
- Add SSH users on pool resize
- More improvements to install doc
2016-11-01 14:14:22 -07:00
Fred Park
f2a8baf00d
Jobs termtasks force and jobs add recreate options
...
- Add docker rm to termtasks command
- Clean up installation doc
2016-11-01 01:15:43 -07:00
Fred Park
fa1024d191
Improve Python2/3 compatibility
...
- Add generated sas key expiry config option
2016-10-31 08:45:05 -07:00
Fred Park
148e8e7a22
Add install and exec helper scripts
2016-10-29 23:12:44 -07:00
Fred Park
fa135bf9d7
Automatically download stdout/err for pool failure
2016-10-29 11:00:31 -07:00
Fred Park
6c95312115
Update CHANGELOG for 2.0.0rc1 tag
2016-10-28 10:16:57 -07:00
Fred Park
72f9c90baf
Remove name requirement for multi-instance tasks
...
- Update TensorFlow-Distributed gpu launcher script to autodetect gpus
- Separate config scripts for TensorFlow-Distributed into CPU and GPU
2016-10-27 23:42:57 -07:00
Fred Park
5f160e8938
Update docs for new CLI
...
- CLI fixes
- Add more convenience subcommands
2016-10-27 22:11:46 -07:00
Fred Park
3a03a0d378
CLI conversion to click
...
- Move version to init
- Remove version from tfm
- Add more subcommands
2016-10-27 09:56:18 -07:00
Fred Park
beeb118b19
Add generated_file_export_path option
...
- Add Dockerfile for cli
- Update docs for docker cli
- Update travis build to include tfm
2016-10-25 22:15:35 -07:00
Fred Park
2266d062a1
Add MXNet recipes
...
- Disable mounting /opt/intel for SLES-HPC hosts
2016-10-24 22:31:14 -07:00
Fred Park
705ae40065
Add support for pool resize up with GlusterFS
...
- Update azure-batch dependency to 1.1.0
2016-10-24 10:08:13 -07:00
Fred Park
9a13dbe83b
Update CNTK to 1.7.2 and recipes
...
- Fix python2+Windows file encoding issue
- Add deljobswait action
2016-10-22 22:43:48 -07:00
Fred Park
92464b3b54
Add Azure Batch Task data ingress
...
- Rearrange Dockerfiles
- Update TensorFlow-Distributed recipe
- Rename CASCADE env vars to SHIPYARD
2016-10-20 21:18:31 -07:00
Fred Park
481d298e7c
Add credential encryption guide
2016-10-20 10:48:25 -07:00
Fred Park
74d3eea339
Add Encrypted Credential support
2016-10-19 21:14:53 -07:00
Fred Park
bb515e3812
Add Data Movement guide
2016-10-17 13:33:12 -07:00
Fred Park
1436cd4378
Add compute node to Azure storage egress support
2016-10-16 16:57:48 -07:00
Fred Park
bd7101df16
Rename generate_tunnel_script property
...
- Add Torch-CPU to quickstart
2016-10-15 17:54:16 -07:00
Fred Park
dc1c8d46a3
Add include pattern support for gettaskallfiles
2016-10-15 14:33:12 -07:00
Fred Park
ed84011383
Add Azure File ingress support
2016-10-15 13:58:49 -07:00
Fred Park
33300c551c
Add pool/job/task-level data ingress support
2016-10-14 15:49:20 -07:00
Fred Park
b0d3b9ba69
Data ingress support to Azure Blob Storage
2016-10-14 07:42:53 -07:00
Fred Park
d88475baa7
Add listjobs, listtasks, gettaskallfiles actions
2016-10-13 14:11:52 -07:00
Fred Park
4ce2f1d6c2
Add HPN-SSH support for Ubuntu
...
- Fix some issues with azure file setup and Windows
- Add some validation with container naming
- Clean up storage with delpool action
- Update .gitignore
2016-10-13 10:55:49 -07:00
Fred Park
a4ec217f66
First stage in shipyard modularization
...
- Update configuration docs for new data ingress spec
2016-10-09 15:22:15 -07:00
Fred Park
edec6f0584
Add scp and multinode_scp ingress support
2016-10-09 11:51:02 -07:00
Fred Park
487223e8fa
Change pool config ssh_docker_tunnel to ssh
2016-10-06 11:03:10 -07:00
Fred Park
d83d39f64c
Add version
2016-10-05 09:39:02 -07:00
Fred Park
3590f7bf2d
Add Torch-CPU and Torch-GPU recipes
2016-10-03 20:02:29 -07:00
Fred Park
b7a5335874
Add preliminary SUSE SLES-HPC support for IB
2016-09-30 22:00:16 -07:00
Fred Park
88862ad57d
Fix GlusterFS setup on Ubuntu
...
- OpenFOAM default swap from v1606+ to 4.0
2016-09-29 19:15:38 -07:00
Fred Park
be36e2face
Add OpenFOAM-Infiniband-IntelMPI recipe
...
- Add real NAMD-Infiniband-IntelMPI image
- GlusterFS mountpoint now inside AZ_BATCH_NODE_SHARED_DIR
2016-09-28 21:03:36 -07:00
Fred Park
2ac48b846d
Add NAMD-GPU recipe
2016-09-26 11:17:50 -07:00
Fred Park
c57abc8636
Add OpenFOAM-TCP-OpenMPI recipe
2016-09-22 13:39:18 -07:00
Fred Park
6aea6782ce
Update Azure File DVD to 0.5.1
...
- Update quickstart to accommodate choice
- Change STANDARD_F1 to STANDARD_D1_V2 for some recipes
2016-09-21 09:23:00 -07:00
Fred Park
a3d19ddb9d
Add Caffe-CPU, TensorFlow-CPU recipes
2016-09-19 14:26:57 -07:00
Fred Park
1188a1e885
Fix shipyard container detach/cleanup
...
- Add @FIRSTRUNNING task id for streamfile/gettaskfile
2016-09-18 17:01:42 -07:00
Fred Park
cd9a4e5bb5
Deterministic remote login settings output
2016-09-16 22:20:07 -07:00
Fred Park
d4d32fb699
Fix inter-node comm omitted keyerror
2016-09-16 17:54:49 -07:00
Fred Park
a2f99720f7
Add --configdir argument for convenience
2016-09-16 14:39:44 -07:00
Fred Park
ea702d3a43
Add quickstart doc
...
- Disable ssh tunnel creation without public key on Windows
2016-09-16 13:45:12 -07:00
Fred Park
9d12dab8be
Fix cascade start issue without private registry
...
- Add GlusterFS support for ubuntu and opensuse/sles
- Add --filespec and --verbose parameters
2016-09-16 11:46:59 -07:00
Fred Park
08204092be
Add CentOS GlusterFS support
...
- Update recipes
2016-09-15 12:47:43 -07:00
Fred Park
646cff6631
Add TensorFlow-Distributed recipe
...
- Fix SSH user expiry within 1 day
- Fix some README/dockerfile typos
2016-09-13 11:43:28 -07:00
Fred Park
af98bdfb57
Add sample configs for all existing recipes
...
- Fix temp file creation for cross-platform
2016-09-09 13:49:59 -07:00
Fred Park
e8d5e7a8a3
Automatically detect nvidia driver version
...
- Fix azure-storage dependencies for non-shipyard docker image setup
- Add no-install-recommends to apt-gets in node prep
2016-09-08 21:06:21 -07:00
Fred Park
0338fc612f
Add gettaskfile/getnodefile actions
...
- Add .gitattributes to designate text files eol as LF
- Update configuration doc with required/optional tags
2016-09-08 20:15:11 -07:00
Fred Park
b4e6e90f1d
Add FFmpeg GPU recipe
...
- Fix NV-series provisioning
- Fix up various READMEs
- Add maintained by tags in Dockerfiles
- Add missing config flag in jobs json
- Fix non-Docker shipyard azure-storage req
2016-09-08 11:52:21 -07:00
Fred Park
028768a61a
Add CNTK recipes
...
- Add TCP optimization
- Fix job autocompletion
- Update azure-storage requirement to 0.33.0
2016-09-07 21:40:57 -07:00
Fred Park
eea254aeb9
Add job auto-completion for multi-instance tasks
2016-09-06 10:32:42 -07:00
Fred Park
e52e30cf0c
Fix docker images issue with non-p2p transfer
...
- Fix node prep cascade timing issues
- Update various READMEs
2016-09-02 15:02:07 -07:00
Fred Park
cc0a3401ff
Update Changelog for tag
2016-09-01 09:40:24 -07:00
Fred Park
a5387fc904
Update README
2016-08-31 21:43:03 -07:00
Fred Park
dad22994bc
Rename sample configs as config templates
...
- Add Changelog file
2016-08-31 09:38:33 -07:00