Граф коммитов

360 Коммитов

Автор SHA1 Сообщение Дата
Zihao Chen 059e887288
Merge pull request #30 from AaronYll/fix-tasktimeout-after-network_partition
fix race condition when get stale request to StartTask
2023-03-29 14:45:13 +08:00
Liangliang Yuan 94e8108461 fix race condition when get stale request to StartTask 2023-03-15 13:21:59 +08:00
microsoft-github-policy-service[bot] 62449c5b3c
Auto merge mandatory file pr
This pr is auto merged as it contains a mandatory file and is opened for more than 10 days.
2023-02-13 08:58:20 +00:00
Zihao Chen e44cf5e69c
Merge pull request #29 from AaronYll/master
Support build from docker
2023-02-13 16:58:13 +08:00
Liangliang Yuan 9e3d9a55ff Support build from docker 2023-02-13 16:51:16 +08:00
microsoft-github-policy-service[bot] 2df46e0976
Microsoft mandatory file 2023-01-24 17:24:48 +00:00
Robert Zhang 36029057fd
Merge pull request #14 from Azure/3rdparty
Add 3rd party licenses.
2019-04-16 13:30:09 +08:00
Robert Zhang b506187813 Add 3rd party licenses. 2019-04-16 11:36:08 +08:00
Zihao Chen 90cfd96c3f
Update LICENSE 2019-03-19 11:36:51 +08:00
Evan Cui 1583eaaaf5 Support the acm node manager 2019-01-28 15:48:41 +08:00
Evan Cui 1ec9b06234 Update the version 2018-09-25 14:15:36 +08:00
Evan Cui a33ad9ca62 Merge branch 'master' of https://github.com/coolmay/whpc-linux-communicator 2018-09-25 14:13:24 +08:00
Evan Cui c285d0f12f Make job id part of the task execution id 2018-09-25 14:13:01 +08:00
Evan Cui 84b28eb9ad
Merge pull request #12 from coin8086/readme
Turn Readme.txt into README.md and add notes on compiling dependencies.
2018-08-03 16:27:10 +08:00
Robert Zhang 1419e22f1c Turn Readme.txt into README.md and add notes on compiling dependencies. 2018-08-03 11:57:25 +08:00
Evan Cui c00987ffc0 Fix the script error 2018-07-05 21:10:16 +08:00
Evan Cui b08bcd49d5 Change the process key algorithm 2018-05-23 02:37:51 +08:00
Evan Cui 54e46566a0 Fix the nodemanager kill 2018-05-23 01:06:13 +08:00
Evan Cui b225e71fa4 Fix the end task method 2018-05-22 18:50:48 +08:00
FAREAST\chezhang 1b6ad50d28 Merge branch 'log' 2018-05-15 18:11:58 +08:00
FAREAST\chezhang 5d7ec8cfc6 revise nodemanager logging for docker and gpu metrics initialization command 2018-05-15 18:11:35 +08:00
Evan Cui d01a1a7013 Fix the build break 2018-05-15 00:24:14 +08:00
Evan Cui 4d9e9f8a52 Disable the metric in options 2018-05-15 00:21:48 +08:00
Evan Cui 927c201a7f change the search path of ldd 2018-05-15 00:09:10 +08:00
Evan Cui e1c04eed84 Added the customized logger 2018-05-14 23:30:07 +08:00
Evan Cui ab17941387 Upgrade casablanca version and the spdlog version, Support for HPC-ACM. 2018-05-14 23:18:22 +08:00
FAREAST\chezhang bf9d5bf9ff Generate hostfile/machinefile for various mpi applications 2018-04-20 20:27:32 +08:00
FAREAST\chezhang d7023ec4d2 Fix a bug that task would fail when cgroup is not enabled 2018-03-12 21:13:29 +08:00
FAREAST\chezhang 77567a7b0d Map Windows system account to Linux root user 2018-02-09 20:54:50 +08:00
Chenling Zhang d90f79e8da Retrieve public key from private key if its value is absent 2017-12-13 17:15:16 +08:00
Chenling Zhang 272193474d #1690111 Avoid leaving credential information on disk when Linux node execution filter task fails - handle more exceptions 2017-11-03 18:07:09 +08:00
Chenling Zhang f2e1b90aca #1690111 Avoid leaving credential information on disk when Linux node execution filter task fails 2017-11-03 14:44:03 +08:00
Chenling Zhang 137e4ede1a #1339398 do not monitor GPU info if initializing GPU driver fails. 2017-10-12 10:48:14 +08:00
Chenling Zhang 7ac024af9e Fix a bug so that root user could pass mutual trust in mpi docker task. 2017-09-28 22:23:59 +08:00
evanc 3b9a07ce47 Fix the unit test failure, enabled cancellation token for classes, added version info 2017-09-26 08:30:19 -07:00
Chenling Zhang 1c5753b412 some revise 2017-09-26 17:22:24 +08:00
Chenling Zhang 3cfd2f0a43 Merge branch 'branch2' 2017-09-18 22:10:15 +08:00
Chenling Zhang cf40cc19ae some revise 2017-09-18 21:34:04 +08:00
Chenling Zhang b56f10d590 Merge branch 'peektaskoutput' 2017-09-14 17:07:03 +08:00
Chenling Zhang 09dd23127b some revise 2017-09-11 17:39:42 +08:00
Chenling Zhang 0b87ef5a11 fix code defect 2017-09-04 21:12:01 +08:00
Chenling Zhang b3fcc87d75 peek task output for linux compute nodes 2017-09-04 21:06:34 +08:00
Chenling Zhang 338dbaafbb nvidia-docker 2017-09-02 18:46:20 +08:00
Chenling Zhang 2fac1c0aff more options 2017-08-03 16:27:32 +08:00
Chenling Zhang 32a40ca298 mutual trust 2017-07-30 15:09:30 +08:00
Chenling Zhang 70962cf9fb basic func 2017-07-22 00:34:42 +08:00
evanc 899827eb64 add task completion uri 2017-07-03 05:05:16 -07:00
evanc 5791fbda05 Added built in proxy support, added unit test for built in proxy 2017-03-31 01:23:44 -07:00
evanc 09a8ec0858 Enable proxy 2017-03-29 08:19:03 -07:00
Evan Cui f272dd54f5 Fix the null reference exception 2017-01-09 21:43:54 +08:00