Граф коммитов

368 Коммитов

Автор SHA1 Сообщение Дата
zclok010 7c94523eb2 backup 2019-05-23 11:46:49 +08:00
zclok010 6a7ef893c6 git ignore .vscode 2019-05-10 12:16:27 +08:00
zclok010 b130b4fd7f Fix test case error introduced in commit b41e682 (excution filter change) 2019-05-09 15:15:48 +08:00
zclok010 d98eec6f15 Add CcpVersion and CustomProperties in register info 2019-05-07 10:43:33 +08:00
zclok010 0d44a14c3e refine code 2019-05-06 20:04:10 +08:00
zclok010 d7a8e4a371 tidy log when cleaning up zombie tasks 2019-05-05 14:57:32 +08:00
zclok010 05138060e7 Fix a bug that processes in task are not actually terminated after task canceling if cgroup is not enable 2019-04-30 20:26:31 +08:00
zclok010 844bb394b9 Fix a bug that zombie task clean up would fail when nodemanager starts 2019-04-30 20:24:27 +08:00
zclok010 42b467d5af tidy some code 2019-04-30 15:12:30 +08:00
FAREAST\chezhang 6483b16b0f Update tasks' statistics before sending heartbeat to enable showing statistics when task is running 2019-04-26 10:19:23 +08:00
zclok010 b41e682ab3 Merge branch 'executionfilter' into v2 2019-04-16 15:42:04 +08:00
zclok010 2750349ba1 Add build-in execution filters to adjust task affinity in terms of core distribution in NUMA nodes and to modify command for preparation of mpi task 2019-04-16 15:29:26 +08:00
zclok010 8deeffa1f0 get home dir by tilde expansion 2019-04-08 15:26:17 +08:00
zclok010 2fa7452e49 fix a bug that memory is limited to the first NUMA node when using cgroup 2019-03-28 17:26:35 +08:00
zclok010 4e0e4a2d1e change the default working directory to home 2019-03-28 17:02:54 +08:00
zclok010 7cfce24f5a Change the owner of home directory, which is created by Linux nodemanager, to the user instead of leaving it root. 2019-03-26 13:42:02 +08:00
FAREAST\chezhang 6df8834638 Merge branch 'disableCgroup' into v2 2019-03-18 12:17:21 +08:00
chenling 1a477f059e Fix a bug that HPC Pack environment variables will be lost if CCP_SWITCH_USER is set on CentOS 2019-03-14 17:02:11 +08:00
FAREAST\chezhang ea1a7af86b refine code 2019-03-13 17:23:23 +08:00
FAREAST\chezhang bf49363154 Add environment variable CCP_DISABLE_CGROUP to enable running a task without cgroup 2019-03-13 17:18:38 +08:00
chenling 729f94e18e fix Cores In Use not correct bug 2019-02-19 15:13:17 +08:00
chenling c09de10d5e fix Cores In Use not correct bug 2019-01-29 22:59:24 +08:00
FAREAST\chezhang 45eed2e48a Update version info 2019-01-03 15:42:06 +08:00
FAREAST\chezhang 3604242371 Merge branch 'affinity' into v2 2019-01-03 15:37:17 +08:00
FAREAST\chezhang 0ba6267b9f enable switching user and running task command by using environment variable CCP_SWITCH_USER, which is to fix a bug that command, like "ulimit -l" on Ubuntu, is executed as root even if the runas user is specified as other user 2019-01-03 15:36:31 +08:00
FAREAST\chezhang 14db4d87f3 Fix affinity bug 2019-01-02 21:11:51 +08:00
chenling 53211d728a Update version info 2018-08-28 23:13:34 +08:00
FAREAST\chezhang 6b33a068d6 Reduce naming service request interval when failing to get response from headnode so that Linux compute nodes could appear in a shorter time after ARM template deployment finishes 2018-08-28 17:04:21 +08:00
FAREAST\chezhang 1055fec2ce Fix a bug that nodemanager would delete user ssh keys 2018-07-31 16:22:59 +08:00
Evan Cui 2fea77493f Upgrade the spdlog version; Build the nodemanager in old envi 2018-07-25 18:35:35 +08:00
FAREAST\chezhang 524161c851 Get Linux compute node Azure instance metadata - revise 2018-07-24 17:43:41 +08:00
FAREAST\chezhang d9780148f7 Get Linux compute node Azure instance metadata 2018-07-17 21:13:41 +08:00
FAREAST\chezhang 710da06975 Revise a log message 2018-06-13 15:30:23 +08:00
Evan Cui 475627fd11 Fixed the version info 2018-05-16 17:58:37 +08:00
FAREAST\chezhang bf9d5bf9ff Generate hostfile/machinefile for various mpi applications 2018-04-20 20:27:32 +08:00
FAREAST\chezhang d7023ec4d2 Fix a bug that task would fail when cgroup is not enabled 2018-03-12 21:13:29 +08:00
FAREAST\chezhang 77567a7b0d Map Windows system account to Linux root user 2018-02-09 20:54:50 +08:00
Chenling Zhang d90f79e8da Retrieve public key from private key if its value is absent 2017-12-13 17:15:16 +08:00
Chenling Zhang 272193474d #1690111 Avoid leaving credential information on disk when Linux node execution filter task fails - handle more exceptions 2017-11-03 18:07:09 +08:00
Chenling Zhang f2e1b90aca #1690111 Avoid leaving credential information on disk when Linux node execution filter task fails 2017-11-03 14:44:03 +08:00
Chenling Zhang 137e4ede1a #1339398 do not monitor GPU info if initializing GPU driver fails. 2017-10-12 10:48:14 +08:00
Chenling Zhang 7ac024af9e Fix a bug so that root user could pass mutual trust in mpi docker task. 2017-09-28 22:23:59 +08:00
evanc 3b9a07ce47 Fix the unit test failure, enabled cancellation token for classes, added version info 2017-09-26 08:30:19 -07:00
Chenling Zhang 1c5753b412 some revise 2017-09-26 17:22:24 +08:00
Chenling Zhang 3cfd2f0a43 Merge branch 'branch2' 2017-09-18 22:10:15 +08:00
Chenling Zhang cf40cc19ae some revise 2017-09-18 21:34:04 +08:00
Chenling Zhang b56f10d590 Merge branch 'peektaskoutput' 2017-09-14 17:07:03 +08:00
Chenling Zhang 09dd23127b some revise 2017-09-11 17:39:42 +08:00
Chenling Zhang 0b87ef5a11 fix code defect 2017-09-04 21:12:01 +08:00
Chenling Zhang b3fcc87d75 peek task output for linux compute nodes 2017-09-04 21:06:34 +08:00