Граф коммитов

513 Коммитов

Автор SHA1 Сообщение Дата
Fred Park d539e2923a Fix pool udi terminal mangling 2017-08-03 08:52:20 -07:00
Fred Park dadf574691 Update CNTK-GPU-OpenMPI to 2.1 2017-08-03 08:44:48 -07:00
Fred Park fee0d0e4ae Update CNTK-CPU-OpenMPI recipes for 2.1 2017-08-02 15:12:45 -07:00
Fred Park 9eb8fd4c55 Support CentOS-HPC 7.3
- Update misc tensorboard to latest
- Fix term tasks in disable jobs
- Update NVIDIA driver
- Doc updates
2017-08-02 15:12:45 -07:00
Fred Park 804169d48b Update TensorFlow recipes to 1.2.1
- Fix Distributed TF recipe and make launcher generalized
2017-08-02 11:00:53 -07:00
Fred Park b09b024b11 Fix task factory doc 2017-07-31 15:17:54 -07:00
Fred Park bab8628ed5 Tag for 2.9.0b1 release 2017-07-31 15:05:58 -07:00
Fred Park ed8ca2d225 Add autogen task id setting 2017-07-31 13:40:18 -07:00
Fred Park 196a36336e Add rebalance based on preempted node count 2017-07-31 13:40:15 -07:00
Fred Park 4105acc2f8 Add task factory (parameter sweep) support
- Resolves #93
2017-07-28 14:36:42 -07:00
Fred Park 23a753a110 RemoteFS fixes 2017-07-27 08:12:34 -07:00
Fred Park 7a9177b16b Fix pool deletion with poolid arg 2017-07-26 10:38:29 -07:00
Fred Park 5fef683af4 Universally increase SAS expiry time 2017-07-21 13:28:56 -07:00
Fred Park e32fc4d93e Add Autopool support
- Resolves #33
- Add --poolid to storage clear and storage del
- jobs del and jobs term now cleanup storage data if autopool is
  detected
2017-07-21 11:10:03 -07:00
Fred Park 30ea8c280f Add autoscale guide 2017-07-21 11:10:03 -07:00
Fred Park 7ba85e7496 Add job migration support
- Add enable/disable job support too
- Resolves #108
2017-07-21 11:10:03 -07:00
Fred Park 3b65ba684f Support job priorities
- Resolves #109
2017-07-21 11:10:03 -07:00
Fred Park 23e9584852 Add compute node fill type support
- Resolves #107
2017-07-21 11:10:03 -07:00
Fred Park 82a46a615a Basic Autoscale functionality
- Allow pools to be added with zero target nodes
- Add pool autoscale commands
2017-07-21 11:10:03 -07:00
Fred Park 5291ff1130 Move to blob leasing for download ticketing
- Greatly increase resource file SAS expiry timedelta
- Make concurrent_source_downloads generic, remove non-p2p option
- Update Dockerfiles
- Update to latest azure-storage
2017-07-21 11:10:03 -07:00
Fred Park 1a941648fe Add Azure Cloud Shell info 2017-07-21 11:05:54 -07:00
Fred Park 8a0d40eeee Minor doc update 2017-07-18 08:16:16 -07:00
Fred Park f12ae16414 Add from scratch guide 2017-07-17 14:27:59 -07:00
Fred Park d197c9be28 Minor fixups 2017-07-07 09:07:40 -07:00
Fred Park 03fe791171 Tag for 2.8.0 release 2017-07-06 11:12:24 -07:00
Fred Park 8eb2197d23 Allow CentOS 7.3 on NC/NV 2017-07-06 11:12:05 -07:00
Fred Park de45b18a67 Add backoff to cascade docker image pull retries 2017-07-01 01:25:30 -07:00
Fred Park 2a48885da1 More improvements for scale out robustness
- Add --all-start-task-failed to delnode
- Reduce node output on pool allocation wait with number of nodes > 10
2017-06-30 23:50:21 -07:00
Fred Park 06188c1944 Tag for 2.8.0rc2 release
- Fix regression with private docker image pulls
- Resolves #103
- Resolves #105
2017-06-30 11:45:26 -07:00
Fred Park afde52abe6 Update docs add WSL support
- Resolves #101
2017-06-29 08:58:41 -07:00
Fred Park 3f2bd678c5 Add Mac OS X install.sh support 2017-06-29 07:33:52 -07:00
Fred Park ade6a27b60 Tag for 2.8.0rc1 release 2017-06-27 11:38:36 -07:00
Fred Park 5830209041 Add cloudshell installation support 2017-06-27 10:46:47 -07:00
Fred Park 54422ce2eb Add retry handling for cascade docker pull
- Add cascade.log download for start up failures
2017-06-27 09:28:03 -07:00
Fred Park dca5473504 Improve robustness of package downloads 2017-06-27 07:14:31 -07:00
Fred Park cefa72e443 Add version metadata to pool and jobs
- Resolves #89
2017-06-26 13:20:49 -07:00
Fred Park 94bd35e21c Update Dockerfiles to Alpine 3.6
- Resolves #65
2017-06-26 11:12:11 -07:00
Fred Park a61449ec9c Fix tensorboard command with custom image changes
- Fix ref during exception handling for invalid platform image
- Remove max size note for remote fs managed disks
2017-06-26 10:51:53 -07:00
Fred Park 7fcc8b5aa1 Add conda-forge detection to windows script 2017-06-26 07:58:28 -07:00
David Wallin 643a79549f extend Anaconda check to include conda-forge 2017-06-26 07:27:13 -07:00
Fred Park 35c9779d68 Fix job auto_complete overwrite of job properties
- Resolves #97
2017-06-09 11:34:26 -07:00
Fred Park e53a5bb88d Fix pathing for detecting docker graph location 2017-06-09 11:33:41 -07:00
Fred Park b84d1f522b Update docs 2017-06-09 11:33:15 -07:00
Fred Park 798f0ca3c8 Add port info to custom image doc 2017-06-07 09:33:54 -07:00
Fred Park 887c597fab Tag for 2.8.0b1 release 2017-06-07 08:30:04 -07:00
Fred Park a41713c5ee Add custom image guide
- Update recipes for vm_configuration
- Fix some issues with platform pools with new changes
2017-06-06 12:41:42 -07:00
Fred Park 8397b411c5 Initial custom image support 2017-06-06 08:43:33 -07:00
Fred Park 549f50aac5 Tag for 2.7.0 release 2017-05-31 07:43:14 -07:00
Fred Park 31fbe22c42 Split out low-pri considerations into own doc
- Regarding #92
2017-05-29 13:08:51 -07:00
Fred Park 004413e36e Fix pool udi with no logins/encryption over SSH
- Resolves #92
2017-05-28 15:18:32 -07:00