Граф коммитов

347 Коммитов

Автор SHA1 Сообщение Дата
Evan Cui 84b28eb9ad
Merge pull request #12 from coin8086/readme
Turn Readme.txt into README.md and add notes on compiling dependencies.
2018-08-03 16:27:10 +08:00
Robert Zhang 1419e22f1c Turn Readme.txt into README.md and add notes on compiling dependencies. 2018-08-03 11:57:25 +08:00
Evan Cui c00987ffc0 Fix the script error 2018-07-05 21:10:16 +08:00
Evan Cui b08bcd49d5 Change the process key algorithm 2018-05-23 02:37:51 +08:00
Evan Cui 54e46566a0 Fix the nodemanager kill 2018-05-23 01:06:13 +08:00
Evan Cui b225e71fa4 Fix the end task method 2018-05-22 18:50:48 +08:00
FAREAST\chezhang 1b6ad50d28 Merge branch 'log' 2018-05-15 18:11:58 +08:00
FAREAST\chezhang 5d7ec8cfc6 revise nodemanager logging for docker and gpu metrics initialization command 2018-05-15 18:11:35 +08:00
Evan Cui d01a1a7013 Fix the build break 2018-05-15 00:24:14 +08:00
Evan Cui 4d9e9f8a52 Disable the metric in options 2018-05-15 00:21:48 +08:00
Evan Cui 927c201a7f change the search path of ldd 2018-05-15 00:09:10 +08:00
Evan Cui e1c04eed84 Added the customized logger 2018-05-14 23:30:07 +08:00
Evan Cui ab17941387 Upgrade casablanca version and the spdlog version, Support for HPC-ACM. 2018-05-14 23:18:22 +08:00
FAREAST\chezhang bf9d5bf9ff Generate hostfile/machinefile for various mpi applications 2018-04-20 20:27:32 +08:00
FAREAST\chezhang d7023ec4d2 Fix a bug that task would fail when cgroup is not enabled 2018-03-12 21:13:29 +08:00
FAREAST\chezhang 77567a7b0d Map Windows system account to Linux root user 2018-02-09 20:54:50 +08:00
Chenling Zhang d90f79e8da Retrieve public key from private key if its value is absent 2017-12-13 17:15:16 +08:00
Chenling Zhang 272193474d #1690111 Avoid leaving credential information on disk when Linux node execution filter task fails - handle more exceptions 2017-11-03 18:07:09 +08:00
Chenling Zhang f2e1b90aca #1690111 Avoid leaving credential information on disk when Linux node execution filter task fails 2017-11-03 14:44:03 +08:00
Chenling Zhang 137e4ede1a #1339398 do not monitor GPU info if initializing GPU driver fails. 2017-10-12 10:48:14 +08:00
Chenling Zhang 7ac024af9e Fix a bug so that root user could pass mutual trust in mpi docker task. 2017-09-28 22:23:59 +08:00
evanc 3b9a07ce47 Fix the unit test failure, enabled cancellation token for classes, added version info 2017-09-26 08:30:19 -07:00
Chenling Zhang 1c5753b412 some revise 2017-09-26 17:22:24 +08:00
Chenling Zhang 3cfd2f0a43 Merge branch 'branch2' 2017-09-18 22:10:15 +08:00
Chenling Zhang cf40cc19ae some revise 2017-09-18 21:34:04 +08:00
Chenling Zhang b56f10d590 Merge branch 'peektaskoutput' 2017-09-14 17:07:03 +08:00
Chenling Zhang 09dd23127b some revise 2017-09-11 17:39:42 +08:00
Chenling Zhang 0b87ef5a11 fix code defect 2017-09-04 21:12:01 +08:00
Chenling Zhang b3fcc87d75 peek task output for linux compute nodes 2017-09-04 21:06:34 +08:00
Chenling Zhang 338dbaafbb nvidia-docker 2017-09-02 18:46:20 +08:00
Chenling Zhang 2fac1c0aff more options 2017-08-03 16:27:32 +08:00
Chenling Zhang 32a40ca298 mutual trust 2017-07-30 15:09:30 +08:00
Chenling Zhang 70962cf9fb basic func 2017-07-22 00:34:42 +08:00
evanc 899827eb64 add task completion uri 2017-07-03 05:05:16 -07:00
evanc 5791fbda05 Added built in proxy support, added unit test for built in proxy 2017-03-31 01:23:44 -07:00
evanc 09a8ec0858 Enable proxy 2017-03-29 08:19:03 -07:00
Evan Cui f272dd54f5 Fix the null reference exception 2017-01-09 21:43:54 +08:00
evanc 1899baafec Fix a task process never start issue 2016-11-25 03:13:04 -08:00
evanc dce83ae2de Fix the resync issue 2016-11-09 09:53:25 -08:00
evanc 6e90314fae Fix a job stuck in running issue 2016-11-09 03:42:41 -08:00
evanc 68a110497a Fix the node error when scheduler switch over 2016-11-03 12:13:44 -07:00
evanc 4bfae47057 Fix the unknown node availability issue 2016-10-27 08:36:50 -07:00
evanc cd719cea70 Fix the unit test 2016-10-24 04:55:35 -07:00
evanc afe59d4bbb Merge branch 'master' of github.com:coolmay/whpc-linux-communicator 2016-10-24 04:30:00 -07:00
evanc 546ba90b5c Hpc Pack 2016 support 2016-10-24 04:29:40 -07:00
chezhang ebf9090348 Fix the LinuxCommunicator constructor failure issue by retrying to connect to HpcMonitoringServer service 2016-07-29 17:04:54 +08:00
evanc fdb8eccf6c Fix the node manager crash issue 2016-07-26 03:18:12 -07:00
evanc f44e493463 Fix the instance metric value 2016-07-22 02:53:58 -07:00
evanc a9f62cdda9 t push
Merge branch 'master' of github.com:coolmay/whpc-linux-communicator
2016-07-21 03:56:24 -07:00
evanc 6717d8437a Fix a total memory gpu metric error 2016-07-21 03:56:07 -07:00