Граф коммитов

262 Коммитов

Автор SHA1 Сообщение Дата
hieuhc 28bfda3278 Replace clear() method when invoking pool udi with ssh (#118) 2017-09-07 09:18:46 +01:00
Fred Park 1d5cdcbbdf Tag for 2.9.3 release 2017-08-29 08:05:34 -07:00
Fred Park 010b160e43 Provide warning and note on job migration 2017-08-17 13:08:31 -07:00
Fred Park 5f393beb1d Disallow resize_timeout in AS-enabled pools 2017-08-17 07:06:03 -07:00
Fred Park 9d66d75b62 Tag for 2.9.2 release
- Attempt another fix at site extension upgrade (#113)
2017-08-16 08:59:53 -07:00
Fred Park f91107e89d Tag for 2.9.1 release 2017-08-16 08:33:19 -07:00
Fred Park 9e3308ff2b Fix issues in RemoteFS
- Public ip not being assigned for resize command
- Need to wait for block device to show up for attached disks (expand
  command)
- Add more logging
2017-08-16 08:14:08 -07:00
Fred Park 71706dec6e Fail faster for storage issues in remotefs 2017-08-15 19:40:41 -07:00
Fred Park 3f244d0e4b Fix ssh private key issue in RemoteFS 2017-08-15 16:39:52 -07:00
Fred Park b16685348d Tag for 2.9.0 release 2017-08-15 13:28:55 -07:00
Fred Park 466c7d4a3b Perform client checks 2017-08-15 13:26:59 -07:00
Fred Park e434b83cb3 Fix truncated P50 provisioning
- Support "s_v3" suffixed premium VM SKUs
2017-08-15 13:26:31 -07:00
Fred Park 7a815dff6f Minor updates and fixes 2017-08-14 09:07:07 -07:00
Fred Park 1e4cd777be Fix division by zero in pool stats
- Fix flake8 issues
2017-08-10 15:08:39 -07:00
Fred Park 745082029f Misc doc updates
- Update requests
- Check task id length
- Drop Python 3.3 support due to cryptography
2017-08-10 08:40:07 -07:00
Fred Park 284c4d9c23 Tag for 2.9.0rc1 release 2017-08-09 08:57:33 -07:00
Fred Park 4573180293 Validate and prompt certain job schedule adds 2017-08-09 08:39:26 -07:00
Fred Park 44a1f14b31 Add monitor_task_completion for recurring jobs 2017-08-09 07:57:56 -07:00
Fred Park 5ae9001716 Add job schedule support to commands
- Resolves #19
2017-08-08 15:02:53 -07:00
Fred Park 9add2444ec Change autogen task id property to complex
- Update job recurrence docs
2017-08-08 08:45:15 -07:00
Fred Park be530e63c0 Job recurrence support 2017-08-07 19:42:09 -07:00
Fred Park 99e72c0c3f Add custom task factory support (#93) 2017-08-07 10:38:08 -07:00
Fred Park 754b5ee5a6 Tag for 2.9.0b2 release 2017-08-04 14:51:27 -07:00
Fred Park 8a396f0e18 Add pool and jobs stats
- Resolves #110
2017-08-04 14:47:16 -07:00
Fred Park c5fa85adcb Add file task factory (#93)
- Split out task factory settings into separate file
- Change uniform to be a, b instead of min, max
- Update blobxfer script for single target ingress to place file
  directly to destination
2017-08-04 11:02:33 -07:00
Fred Park 1650ce4a95 Add random task factory (#93) 2017-08-03 20:10:56 -07:00
Fred Park e5ffd492ab Update CNTK CPU infiniband recipe to 2.1 2017-08-03 16:28:23 -07:00
Fred Park 4d09a09a80 Add --all-unused to pool delnode 2017-08-03 10:41:38 -07:00
Fred Park d539e2923a Fix pool udi terminal mangling 2017-08-03 08:52:20 -07:00
Fred Park 9eb8fd4c55 Support CentOS-HPC 7.3
- Update misc tensorboard to latest
- Fix term tasks in disable jobs
- Update NVIDIA driver
- Doc updates
2017-08-02 15:12:45 -07:00
Fred Park bab8628ed5 Tag for 2.9.0b1 release 2017-07-31 15:05:58 -07:00
Fred Park ed8ca2d225 Add autogen task id setting 2017-07-31 13:40:18 -07:00
Fred Park 196a36336e Add rebalance based on preempted node count 2017-07-31 13:40:15 -07:00
Fred Park 4105acc2f8 Add task factory (parameter sweep) support
- Resolves #93
2017-07-28 14:36:42 -07:00
Fred Park 23a753a110 RemoteFS fixes 2017-07-27 08:12:34 -07:00
Fred Park 7a9177b16b Fix pool deletion with poolid arg 2017-07-26 10:38:29 -07:00
Fred Park 5fef683af4 Universally increase SAS expiry time 2017-07-21 13:28:56 -07:00
Fred Park e32fc4d93e Add Autopool support
- Resolves #33
- Add --poolid to storage clear and storage del
- jobs del and jobs term now cleanup storage data if autopool is
  detected
2017-07-21 11:10:03 -07:00
Fred Park 30ea8c280f Add autoscale guide 2017-07-21 11:10:03 -07:00
Fred Park 7ba85e7496 Add job migration support
- Add enable/disable job support too
- Resolves #108
2017-07-21 11:10:03 -07:00
Fred Park 3b65ba684f Support job priorities
- Resolves #109
2017-07-21 11:10:03 -07:00
Fred Park 23e9584852 Add compute node fill type support
- Resolves #107
2017-07-21 11:10:03 -07:00
Fred Park 82a46a615a Basic Autoscale functionality
- Allow pools to be added with zero target nodes
- Add pool autoscale commands
2017-07-21 11:10:03 -07:00
Fred Park 5291ff1130 Move to blob leasing for download ticketing
- Greatly increase resource file SAS expiry timedelta
- Make concurrent_source_downloads generic, remove non-p2p option
- Update Dockerfiles
- Update to latest azure-storage
2017-07-21 11:10:03 -07:00
Fred Park d197c9be28 Minor fixups 2017-07-07 09:07:40 -07:00
Fred Park 03fe791171 Tag for 2.8.0 release 2017-07-06 11:12:24 -07:00
Fred Park 8eb2197d23 Allow CentOS 7.3 on NC/NV 2017-07-06 11:12:05 -07:00
Fred Park de45b18a67 Add backoff to cascade docker image pull retries 2017-07-01 01:25:30 -07:00
Fred Park 2a48885da1 More improvements for scale out robustness
- Add --all-start-task-failed to delnode
- Reduce node output on pool allocation wait with number of nodes > 10
2017-06-30 23:50:21 -07:00
Fred Park 06188c1944 Tag for 2.8.0rc2 release
- Fix regression with private docker image pulls
- Resolves #103
- Resolves #105
2017-06-30 11:45:26 -07:00