Fred Park
e434b83cb3
Fix truncated P50 provisioning
...
- Support "s_v3" suffixed premium VM SKUs
2017-08-15 13:26:31 -07:00
Fred Park
c67225c3e9
Add Caffe2 recipes
...
- Resolves #112
2017-08-14 10:39:15 -07:00
Fred Park
7a815dff6f
Minor updates and fixes
2017-08-14 09:07:07 -07:00
Mat
203872326f
Add deep learning notebooks to contrib ( #111 )
...
* Adds single node CNTK example
* Adds initial set of notebooks and readme
* Updates the notebooks and reorganises them
* Adds missing notebooks
* Updates Readme and next steps
* Fixes links to notebooks
* Moves notebooks to new location.
Updates notebooks to latest shipyard release.
* Adds readme to notebooksdirectory
* Adds notebook description to contrib
* Moves notebooks section
Moves notebooks section so that it is in alphabetical order
* Updates README
* Removes the output from the notebooks
* alfpark: amend notebooks, clarify READMEs
2017-08-14 08:54:01 -07:00
Fred Park
1e4cd777be
Fix division by zero in pool stats
...
- Fix flake8 issues
2017-08-10 15:08:39 -07:00
Fred Park
7ed0db7f78
Add AppVeyor build
...
- Automated site extensions nuget builds
- Resolves #98
2017-08-10 10:29:22 -07:00
Fred Park
745082029f
Misc doc updates
...
- Update requests
- Check task id length
- Drop Python 3.3 support due to cryptography
2017-08-10 08:40:07 -07:00
Fred Park
284c4d9c23
Tag for 2.9.0rc1 release
2017-08-09 08:57:33 -07:00
Fred Park
4573180293
Validate and prompt certain job schedule adds
2017-08-09 08:39:26 -07:00
Fred Park
44a1f14b31
Add monitor_task_completion for recurring jobs
2017-08-09 07:57:56 -07:00
Fred Park
5ae9001716
Add job schedule support to commands
...
- Resolves #19
2017-08-08 15:02:53 -07:00
Fred Park
9add2444ec
Change autogen task id property to complex
...
- Update job recurrence docs
2017-08-08 08:45:15 -07:00
Fred Park
be530e63c0
Job recurrence support
2017-08-07 19:42:09 -07:00
Fred Park
99e72c0c3f
Add custom task factory support ( #93 )
2017-08-07 10:38:08 -07:00
Fred Park
754b5ee5a6
Tag for 2.9.0b2 release
2017-08-04 14:51:27 -07:00
Fred Park
8a396f0e18
Add pool and jobs stats
...
- Resolves #110
2017-08-04 14:47:16 -07:00
Fred Park
c5fa85adcb
Add file task factory ( #93 )
...
- Split out task factory settings into separate file
- Change uniform to be a, b instead of min, max
- Update blobxfer script for single target ingress to place file
directly to destination
2017-08-04 11:02:33 -07:00
Fred Park
1650ce4a95
Add random task factory ( #93 )
2017-08-03 20:10:56 -07:00
Fred Park
05e9773741
Update recipes
...
- `remove_container_after_exit` is now defaulted enabled
- Move to CentOS-HPC 7.3 for ib recipes
2017-08-03 19:13:57 -07:00
Fred Park
e5ffd492ab
Update CNTK CPU infiniband recipe to 2.1
2017-08-03 16:28:23 -07:00
Fred Park
a8f721a752
Add CNTK GPU ib recipe
2017-08-03 13:40:32 -07:00
Fred Park
4d09a09a80
Add --all-unused to pool delnode
2017-08-03 10:41:38 -07:00
Fred Park
d539e2923a
Fix pool udi terminal mangling
2017-08-03 08:52:20 -07:00
Fred Park
dadf574691
Update CNTK-GPU-OpenMPI to 2.1
2017-08-03 08:44:48 -07:00
Fred Park
fee0d0e4ae
Update CNTK-CPU-OpenMPI recipes for 2.1
2017-08-02 15:12:45 -07:00
Fred Park
9eb8fd4c55
Support CentOS-HPC 7.3
...
- Update misc tensorboard to latest
- Fix term tasks in disable jobs
- Update NVIDIA driver
- Doc updates
2017-08-02 15:12:45 -07:00
Fred Park
804169d48b
Update TensorFlow recipes to 1.2.1
...
- Fix Distributed TF recipe and make launcher generalized
2017-08-02 11:00:53 -07:00
Fred Park
b09b024b11
Fix task factory doc
2017-07-31 15:17:54 -07:00
Fred Park
bab8628ed5
Tag for 2.9.0b1 release
2017-07-31 15:05:58 -07:00
Fred Park
ed8ca2d225
Add autogen task id setting
2017-07-31 13:40:18 -07:00
Fred Park
196a36336e
Add rebalance based on preempted node count
2017-07-31 13:40:15 -07:00
Fred Park
4105acc2f8
Add task factory (parameter sweep) support
...
- Resolves #93
2017-07-28 14:36:42 -07:00
Fred Park
23a753a110
RemoteFS fixes
2017-07-27 08:12:34 -07:00
Fred Park
7a9177b16b
Fix pool deletion with poolid arg
2017-07-26 10:38:29 -07:00
Fred Park
5fef683af4
Universally increase SAS expiry time
2017-07-21 13:28:56 -07:00
Fred Park
e32fc4d93e
Add Autopool support
...
- Resolves #33
- Add --poolid to storage clear and storage del
- jobs del and jobs term now cleanup storage data if autopool is
detected
2017-07-21 11:10:03 -07:00
Fred Park
30ea8c280f
Add autoscale guide
2017-07-21 11:10:03 -07:00
Fred Park
7ba85e7496
Add job migration support
...
- Add enable/disable job support too
- Resolves #108
2017-07-21 11:10:03 -07:00
Fred Park
3b65ba684f
Support job priorities
...
- Resolves #109
2017-07-21 11:10:03 -07:00
Fred Park
23e9584852
Add compute node fill type support
...
- Resolves #107
2017-07-21 11:10:03 -07:00
Fred Park
82a46a615a
Basic Autoscale functionality
...
- Allow pools to be added with zero target nodes
- Add pool autoscale commands
2017-07-21 11:10:03 -07:00
Fred Park
5291ff1130
Move to blob leasing for download ticketing
...
- Greatly increase resource file SAS expiry timedelta
- Make concurrent_source_downloads generic, remove non-p2p option
- Update Dockerfiles
- Update to latest azure-storage
2017-07-21 11:10:03 -07:00
Fred Park
1a941648fe
Add Azure Cloud Shell info
2017-07-21 11:05:54 -07:00
Fred Park
8a0d40eeee
Minor doc update
2017-07-18 08:16:16 -07:00
Fred Park
f12ae16414
Add from scratch guide
2017-07-17 14:27:59 -07:00
Fred Park
d197c9be28
Minor fixups
2017-07-07 09:07:40 -07:00
Fred Park
03fe791171
Tag for 2.8.0 release
2017-07-06 11:12:24 -07:00
Fred Park
8eb2197d23
Allow CentOS 7.3 on NC/NV
2017-07-06 11:12:05 -07:00
Fred Park
de45b18a67
Add backoff to cascade docker image pull retries
2017-07-01 01:25:30 -07:00
Fred Park
2a48885da1
More improvements for scale out robustness
...
- Add --all-start-task-failed to delnode
- Reduce node output on pool allocation wait with number of nodes > 10
2017-06-30 23:50:21 -07:00