Fred Park
162219f1b8
Add HPCG benchmark recipe
...
- Prepare for 2.0.0rc2 release
2016-11-02 08:47:39 -07:00
Fred Park
9c604d2572
More comprehensive pool resize logic with SSH user
2016-11-01 17:10:03 -07:00
Fred Park
155fabfb8d
Add HPL benchmark recipe
...
- Add SSH users on pool resize
- More improvements to install doc
2016-11-01 14:14:22 -07:00
Fred Park
f2a8baf00d
Jobs termtasks force and jobs add recreate options
...
- Add docker rm to termtasks command
- Clean up installation doc
2016-11-01 01:15:43 -07:00
Fred Park
8f002d9510
Migrate to from . import syntax
2016-10-31 21:05:07 -07:00
Fred Park
07efc73344
Fix install script and python version
2016-10-31 10:27:39 -07:00
Fred Park
1f0cbcb6ca
Fix GlusterFS direct ingress with one directory
2016-10-31 09:31:04 -07:00
Fred Park
fa1024d191
Improve Python2/3 compatibility
...
- Add generated sas key expiry config option
2016-10-31 08:45:05 -07:00
Fred Park
e7b9faae68
Refactor json config loading
...
- Catch ValueError on json load and give more precise issue text
2016-10-30 15:45:58 -07:00
Fred Park
efb8c3105f
Add wait option for pool resize
...
- Fix TMPDIR sed command
- Add generated shipyard script to gitignore
2016-10-30 01:44:57 -07:00
Fred Park
eb2f108e86
Add TMPDIR redirect
...
- Fix Debian Jessie docker opts not loading
2016-10-29 23:27:23 -07:00
Fred Park
148e8e7a22
Add install and exec helper scripts
2016-10-29 23:12:44 -07:00
Fred Park
fa135bf9d7
Automatically download stdout/err for pool failure
2016-10-29 11:00:31 -07:00
Fred Park
8c8f42b452
Various fixes
...
- Fix RDMA instances set check during jobs add
- Remove unicode_literals import from cli
- Fix json file loading to use pathlib open
- Fix docker container termination issues in termtasks and job release
2016-10-28 17:44:39 -07:00
Fred Park
80931c544f
Minor fixes/typos
2016-10-28 11:03:40 -07:00
Fred Park
6c95312115
Update CHANGELOG for 2.0.0rc1 tag
2016-10-28 10:16:57 -07:00
Fred Park
231d5dfbe1
Add deltasks, termtasks subcommands for jobs
...
- Use docker kill to terminate tasks
- Add scoping to jobs del/term
2016-10-28 10:08:59 -07:00
Fred Park
72f9c90baf
Remove name requirement for multi-instance tasks
...
- Update TensorFlow-Distributed gpu launcher script to autodetect gpus
- Separate config scripts for TensorFlow-Distributed into CPU and GPU
2016-10-27 23:42:57 -07:00
Fred Park
5f160e8938
Update docs for new CLI
...
- CLI fixes
- Add more convenience subcommands
2016-10-27 22:11:46 -07:00
Fred Park
818a1cda03
Add a contributing recipes guide
2016-10-27 12:53:12 -07:00
Fred Park
3a03a0d378
CLI conversion to click
...
- Move version to init
- Remove version from tfm
- Add more subcommands
2016-10-27 09:56:18 -07:00
Fred Park
ef37280616
Update MXNet recipes
2016-10-26 11:18:52 -07:00
Fred Park
beeb118b19
Add generated_file_export_path option
...
- Add Dockerfile for cli
- Update docs for docker cli
- Update travis build to include tfm
2016-10-25 22:15:35 -07:00
Fred Park
0a702d1f8b
Prep for multi image Batch-Shipyard docker repo
2016-10-25 15:02:28 -07:00
Fred Park
2266d062a1
Add MXNet recipes
...
- Disable mounting /opt/intel for SLES-HPC hosts
2016-10-24 22:31:14 -07:00
Fred Park
180627a229
Default SLES docker install to module
2016-10-24 12:51:25 -07:00
Fred Park
de1f3e39b2
reboot_on_start_task_failed to false in recipes
2016-10-24 10:13:46 -07:00
Fred Park
705ae40065
Add support for pool resize up with GlusterFS
...
- Update azure-batch dependency to 1.1.0
2016-10-24 10:08:13 -07:00
Fred Park
9a13dbe83b
Update CNTK to 1.7.2 and recipes
...
- Fix python2+Windows file encoding issue
- Add deljobswait action
2016-10-22 22:43:48 -07:00
Fred Park
3e5873b5df
Add ingress/egress overview picture to DM guide
...
- Prevent glusterfs pool allocation if internode comm not enabled
2016-10-21 09:04:28 -07:00
Fred Park
92464b3b54
Add Azure Batch Task data ingress
...
- Rearrange Dockerfiles
- Update TensorFlow-Distributed recipe
- Rename CASCADE env vars to SHIPYARD
2016-10-20 21:18:31 -07:00
Fred Park
481d298e7c
Add credential encryption guide
2016-10-20 10:48:25 -07:00
Fred Park
74d3eea339
Add Encrypted Credential support
2016-10-19 21:14:53 -07:00
Fred Park
bb515e3812
Add Data Movement guide
2016-10-17 13:33:12 -07:00
Fred Park
1436cd4378
Add compute node to Azure storage egress support
2016-10-16 16:57:48 -07:00
Fred Park
bd7101df16
Rename generate_tunnel_script property
...
- Add Torch-CPU to quickstart
2016-10-15 17:54:16 -07:00
Fred Park
dc1c8d46a3
Add include pattern support for gettaskallfiles
2016-10-15 14:33:12 -07:00
Fred Park
ed84011383
Add Azure File ingress support
2016-10-15 13:58:49 -07:00
Fred Park
33300c551c
Add pool/job/task-level data ingress support
2016-10-14 15:49:20 -07:00
Fred Park
9d0c6d3ca6
Integrate blob storage ingress with pool creation
2016-10-14 09:36:38 -07:00
Fred Park
b0d3b9ba69
Data ingress support to Azure Blob Storage
2016-10-14 07:42:53 -07:00
Fred Park
d88475baa7
Add listjobs, listtasks, gettaskallfiles actions
2016-10-13 14:11:52 -07:00
Fred Park
4ce2f1d6c2
Add HPN-SSH support for Ubuntu
...
- Fix some issues with azure file setup and Windows
- Add some validation with container naming
- Clean up storage with delpool action
- Update .gitignore
2016-10-13 10:55:49 -07:00
Fred Park
8101a1407f
Add arbitrary file split support
2016-10-12 15:23:27 -07:00
Fred Park
1bc8e60fe2
Add include/exclude filter support for source path
2016-10-12 09:30:06 -07:00
Fred Park
261984020e
Add rsync transfer methods
...
- Refactor data transfer functions into single/multinode
- Expand docs for data ingress
2016-10-11 10:39:54 -07:00
Fred Park
ae1e3c19d9
Further refactor shipyard into components
...
- Add convoy path for travis
2016-10-09 21:01:11 -07:00
Fred Park
a4ec217f66
First stage in shipyard modularization
...
- Update configuration docs for new data ingress spec
2016-10-09 15:22:15 -07:00
Fred Park
edec6f0584
Add scp and multinode_scp ingress support
2016-10-09 11:51:02 -07:00
Fred Park
487223e8fa
Change pool config ssh_docker_tunnel to ssh
2016-10-06 11:03:10 -07:00