Граф коммитов

140 Коммитов

Автор SHA1 Сообщение Дата
Fred Park 162219f1b8 Add HPCG benchmark recipe
- Prepare for 2.0.0rc2 release
2016-11-02 08:47:39 -07:00
Fred Park 9c604d2572 More comprehensive pool resize logic with SSH user 2016-11-01 17:10:03 -07:00
Fred Park 155fabfb8d Add HPL benchmark recipe
- Add SSH users on pool resize
- More improvements to install doc
2016-11-01 14:14:22 -07:00
Fred Park f2a8baf00d Jobs termtasks force and jobs add recreate options
- Add docker rm to termtasks command
- Clean up installation doc
2016-11-01 01:15:43 -07:00
Fred Park 8f002d9510 Migrate to from . import syntax 2016-10-31 21:05:07 -07:00
Fred Park 07efc73344 Fix install script and python version 2016-10-31 10:27:39 -07:00
Fred Park 1f0cbcb6ca Fix GlusterFS direct ingress with one directory 2016-10-31 09:31:04 -07:00
Fred Park fa1024d191 Improve Python2/3 compatibility
- Add generated sas key expiry config option
2016-10-31 08:45:05 -07:00
Fred Park e7b9faae68 Refactor json config loading
- Catch ValueError on json load and give more precise issue text
2016-10-30 15:45:58 -07:00
Fred Park efb8c3105f Add wait option for pool resize
- Fix TMPDIR sed command
- Add generated shipyard script to gitignore
2016-10-30 01:44:57 -07:00
Fred Park eb2f108e86 Add TMPDIR redirect
- Fix Debian Jessie docker opts not loading
2016-10-29 23:27:23 -07:00
Fred Park 148e8e7a22 Add install and exec helper scripts 2016-10-29 23:12:44 -07:00
Fred Park fa135bf9d7 Automatically download stdout/err for pool failure 2016-10-29 11:00:31 -07:00
Fred Park 8c8f42b452 Various fixes
- Fix RDMA instances set check during jobs add
- Remove unicode_literals import from cli
- Fix json file loading to use pathlib open
- Fix docker container termination issues in termtasks and job release
2016-10-28 17:44:39 -07:00
Fred Park 80931c544f Minor fixes/typos 2016-10-28 11:03:40 -07:00
Fred Park 6c95312115 Update CHANGELOG for 2.0.0rc1 tag 2016-10-28 10:16:57 -07:00
Fred Park 231d5dfbe1 Add deltasks, termtasks subcommands for jobs
- Use docker kill to terminate tasks
- Add scoping to jobs del/term
2016-10-28 10:08:59 -07:00
Fred Park 72f9c90baf Remove name requirement for multi-instance tasks
- Update TensorFlow-Distributed gpu launcher script to autodetect gpus
- Separate config scripts for TensorFlow-Distributed into CPU and GPU
2016-10-27 23:42:57 -07:00
Fred Park 5f160e8938 Update docs for new CLI
- CLI fixes
- Add more convenience subcommands
2016-10-27 22:11:46 -07:00
Fred Park 818a1cda03 Add a contributing recipes guide 2016-10-27 12:53:12 -07:00
Fred Park 3a03a0d378 CLI conversion to click
- Move version to init
- Remove version from tfm
- Add more subcommands
2016-10-27 09:56:18 -07:00
Fred Park ef37280616 Update MXNet recipes 2016-10-26 11:18:52 -07:00
Fred Park beeb118b19 Add generated_file_export_path option
- Add Dockerfile for cli
- Update docs for docker cli
- Update travis build to include tfm
2016-10-25 22:15:35 -07:00
Fred Park 0a702d1f8b Prep for multi image Batch-Shipyard docker repo 2016-10-25 15:02:28 -07:00
Fred Park 2266d062a1 Add MXNet recipes
- Disable mounting /opt/intel for SLES-HPC hosts
2016-10-24 22:31:14 -07:00
Fred Park 180627a229 Default SLES docker install to module 2016-10-24 12:51:25 -07:00
Fred Park de1f3e39b2 reboot_on_start_task_failed to false in recipes 2016-10-24 10:13:46 -07:00
Fred Park 705ae40065 Add support for pool resize up with GlusterFS
- Update azure-batch dependency to 1.1.0
2016-10-24 10:08:13 -07:00
Fred Park 9a13dbe83b Update CNTK to 1.7.2 and recipes
- Fix python2+Windows file encoding issue
- Add deljobswait action
2016-10-22 22:43:48 -07:00
Fred Park 3e5873b5df Add ingress/egress overview picture to DM guide
- Prevent glusterfs pool allocation if internode comm not enabled
2016-10-21 09:04:28 -07:00
Fred Park 92464b3b54 Add Azure Batch Task data ingress
- Rearrange Dockerfiles
- Update TensorFlow-Distributed recipe
- Rename CASCADE env vars to SHIPYARD
2016-10-20 21:18:31 -07:00
Fred Park 481d298e7c Add credential encryption guide 2016-10-20 10:48:25 -07:00
Fred Park 74d3eea339 Add Encrypted Credential support 2016-10-19 21:14:53 -07:00
Fred Park bb515e3812 Add Data Movement guide 2016-10-17 13:33:12 -07:00
Fred Park 1436cd4378 Add compute node to Azure storage egress support 2016-10-16 16:57:48 -07:00
Fred Park bd7101df16 Rename generate_tunnel_script property
- Add Torch-CPU to quickstart
2016-10-15 17:54:16 -07:00
Fred Park dc1c8d46a3 Add include pattern support for gettaskallfiles 2016-10-15 14:33:12 -07:00
Fred Park ed84011383 Add Azure File ingress support 2016-10-15 13:58:49 -07:00
Fred Park 33300c551c Add pool/job/task-level data ingress support 2016-10-14 15:49:20 -07:00
Fred Park 9d0c6d3ca6 Integrate blob storage ingress with pool creation 2016-10-14 09:36:38 -07:00
Fred Park b0d3b9ba69 Data ingress support to Azure Blob Storage 2016-10-14 07:42:53 -07:00
Fred Park d88475baa7 Add listjobs, listtasks, gettaskallfiles actions 2016-10-13 14:11:52 -07:00
Fred Park 4ce2f1d6c2 Add HPN-SSH support for Ubuntu
- Fix some issues with azure file setup and Windows
- Add some validation with container naming
- Clean up storage with delpool action
- Update .gitignore
2016-10-13 10:55:49 -07:00
Fred Park 8101a1407f Add arbitrary file split support 2016-10-12 15:23:27 -07:00
Fred Park 1bc8e60fe2 Add include/exclude filter support for source path 2016-10-12 09:30:06 -07:00
Fred Park 261984020e Add rsync transfer methods
- Refactor data transfer functions into single/multinode
- Expand docs for data ingress
2016-10-11 10:39:54 -07:00
Fred Park ae1e3c19d9 Further refactor shipyard into components
- Add convoy path for travis
2016-10-09 21:01:11 -07:00
Fred Park a4ec217f66 First stage in shipyard modularization
- Update configuration docs for new data ingress spec
2016-10-09 15:22:15 -07:00
Fred Park edec6f0584 Add scp and multinode_scp ingress support 2016-10-09 11:51:02 -07:00
Fred Park 487223e8fa Change pool config ssh_docker_tunnel to ssh 2016-10-06 11:03:10 -07:00