Fred Park
|
d539e2923a
|
Fix pool udi terminal mangling
|
2017-08-03 08:52:20 -07:00 |
Fred Park
|
dadf574691
|
Update CNTK-GPU-OpenMPI to 2.1
|
2017-08-03 08:44:48 -07:00 |
Fred Park
|
fee0d0e4ae
|
Update CNTK-CPU-OpenMPI recipes for 2.1
|
2017-08-02 15:12:45 -07:00 |
Fred Park
|
9eb8fd4c55
|
Support CentOS-HPC 7.3
- Update misc tensorboard to latest
- Fix term tasks in disable jobs
- Update NVIDIA driver
- Doc updates
|
2017-08-02 15:12:45 -07:00 |
Fred Park
|
804169d48b
|
Update TensorFlow recipes to 1.2.1
- Fix Distributed TF recipe and make launcher generalized
|
2017-08-02 11:00:53 -07:00 |
Fred Park
|
b09b024b11
|
Fix task factory doc
|
2017-07-31 15:17:54 -07:00 |
Fred Park
|
bab8628ed5
|
Tag for 2.9.0b1 release
|
2017-07-31 15:05:58 -07:00 |
Fred Park
|
ed8ca2d225
|
Add autogen task id setting
|
2017-07-31 13:40:18 -07:00 |
Fred Park
|
196a36336e
|
Add rebalance based on preempted node count
|
2017-07-31 13:40:15 -07:00 |
Fred Park
|
4105acc2f8
|
Add task factory (parameter sweep) support
- Resolves #93
|
2017-07-28 14:36:42 -07:00 |
Fred Park
|
23a753a110
|
RemoteFS fixes
|
2017-07-27 08:12:34 -07:00 |
Fred Park
|
7a9177b16b
|
Fix pool deletion with poolid arg
|
2017-07-26 10:38:29 -07:00 |
Fred Park
|
5fef683af4
|
Universally increase SAS expiry time
|
2017-07-21 13:28:56 -07:00 |
Fred Park
|
e32fc4d93e
|
Add Autopool support
- Resolves #33
- Add --poolid to storage clear and storage del
- jobs del and jobs term now cleanup storage data if autopool is
detected
|
2017-07-21 11:10:03 -07:00 |
Fred Park
|
30ea8c280f
|
Add autoscale guide
|
2017-07-21 11:10:03 -07:00 |
Fred Park
|
7ba85e7496
|
Add job migration support
- Add enable/disable job support too
- Resolves #108
|
2017-07-21 11:10:03 -07:00 |
Fred Park
|
3b65ba684f
|
Support job priorities
- Resolves #109
|
2017-07-21 11:10:03 -07:00 |
Fred Park
|
23e9584852
|
Add compute node fill type support
- Resolves #107
|
2017-07-21 11:10:03 -07:00 |
Fred Park
|
82a46a615a
|
Basic Autoscale functionality
- Allow pools to be added with zero target nodes
- Add pool autoscale commands
|
2017-07-21 11:10:03 -07:00 |
Fred Park
|
5291ff1130
|
Move to blob leasing for download ticketing
- Greatly increase resource file SAS expiry timedelta
- Make concurrent_source_downloads generic, remove non-p2p option
- Update Dockerfiles
- Update to latest azure-storage
|
2017-07-21 11:10:03 -07:00 |
Fred Park
|
1a941648fe
|
Add Azure Cloud Shell info
|
2017-07-21 11:05:54 -07:00 |
Fred Park
|
8a0d40eeee
|
Minor doc update
|
2017-07-18 08:16:16 -07:00 |
Fred Park
|
f12ae16414
|
Add from scratch guide
|
2017-07-17 14:27:59 -07:00 |
Fred Park
|
d197c9be28
|
Minor fixups
|
2017-07-07 09:07:40 -07:00 |
Fred Park
|
03fe791171
|
Tag for 2.8.0 release
|
2017-07-06 11:12:24 -07:00 |
Fred Park
|
8eb2197d23
|
Allow CentOS 7.3 on NC/NV
|
2017-07-06 11:12:05 -07:00 |
Fred Park
|
de45b18a67
|
Add backoff to cascade docker image pull retries
|
2017-07-01 01:25:30 -07:00 |
Fred Park
|
2a48885da1
|
More improvements for scale out robustness
- Add --all-start-task-failed to delnode
- Reduce node output on pool allocation wait with number of nodes > 10
|
2017-06-30 23:50:21 -07:00 |
Fred Park
|
06188c1944
|
Tag for 2.8.0rc2 release
- Fix regression with private docker image pulls
- Resolves #103
- Resolves #105
|
2017-06-30 11:45:26 -07:00 |
Fred Park
|
afde52abe6
|
Update docs add WSL support
- Resolves #101
|
2017-06-29 08:58:41 -07:00 |
Fred Park
|
3f2bd678c5
|
Add Mac OS X install.sh support
|
2017-06-29 07:33:52 -07:00 |
Fred Park
|
ade6a27b60
|
Tag for 2.8.0rc1 release
|
2017-06-27 11:38:36 -07:00 |
Fred Park
|
5830209041
|
Add cloudshell installation support
|
2017-06-27 10:46:47 -07:00 |
Fred Park
|
54422ce2eb
|
Add retry handling for cascade docker pull
- Add cascade.log download for start up failures
|
2017-06-27 09:28:03 -07:00 |
Fred Park
|
dca5473504
|
Improve robustness of package downloads
|
2017-06-27 07:14:31 -07:00 |
Fred Park
|
cefa72e443
|
Add version metadata to pool and jobs
- Resolves #89
|
2017-06-26 13:20:49 -07:00 |
Fred Park
|
94bd35e21c
|
Update Dockerfiles to Alpine 3.6
- Resolves #65
|
2017-06-26 11:12:11 -07:00 |
Fred Park
|
a61449ec9c
|
Fix tensorboard command with custom image changes
- Fix ref during exception handling for invalid platform image
- Remove max size note for remote fs managed disks
|
2017-06-26 10:51:53 -07:00 |
Fred Park
|
7fcc8b5aa1
|
Add conda-forge detection to windows script
|
2017-06-26 07:58:28 -07:00 |
David Wallin
|
643a79549f
|
extend Anaconda check to include conda-forge
|
2017-06-26 07:27:13 -07:00 |
Fred Park
|
35c9779d68
|
Fix job auto_complete overwrite of job properties
- Resolves #97
|
2017-06-09 11:34:26 -07:00 |
Fred Park
|
e53a5bb88d
|
Fix pathing for detecting docker graph location
|
2017-06-09 11:33:41 -07:00 |
Fred Park
|
b84d1f522b
|
Update docs
|
2017-06-09 11:33:15 -07:00 |
Fred Park
|
798f0ca3c8
|
Add port info to custom image doc
|
2017-06-07 09:33:54 -07:00 |
Fred Park
|
887c597fab
|
Tag for 2.8.0b1 release
|
2017-06-07 08:30:04 -07:00 |
Fred Park
|
a41713c5ee
|
Add custom image guide
- Update recipes for vm_configuration
- Fix some issues with platform pools with new changes
|
2017-06-06 12:41:42 -07:00 |
Fred Park
|
8397b411c5
|
Initial custom image support
|
2017-06-06 08:43:33 -07:00 |
Fred Park
|
549f50aac5
|
Tag for 2.7.0 release
|
2017-05-31 07:43:14 -07:00 |
Fred Park
|
31fbe22c42
|
Split out low-pri considerations into own doc
- Regarding #92
|
2017-05-29 13:08:51 -07:00 |
Fred Park
|
004413e36e
|
Fix pool udi with no logins/encryption over SSH
- Resolves #92
|
2017-05-28 15:18:32 -07:00 |