Граф коммитов

3995 Коммитов

Автор SHA1 Сообщение Дата
yiyione e4a12008a4
Add JS SDK tests to CI (#4631)
* add test_rest_server_js_sdk.sh

* update test_rest_server_js_sdk.sh

* add to stage_test.sh

* fix

* fix

* update

* update

* update

* update

* test

* test

* test

* test

* test

* update

* test

* test

* test

* update

* update

* update

* update swagger

* update

* update

* update

* update

* update

* update

* update

* fix timeout

* update swagger version
2020-06-23 16:51:40 +08:00
yiyione bf913540bb
[Rest Server] Add examples to swagger (#4620)
* Add examples to swagger

* update swagger version
2020-06-22 15:57:16 +08:00
Zhiyuan He 0b34c1a702
Update README.md (#4633) 2020-06-19 16:11:03 +08:00
shaiic-pai 14344b6da2
merge user manual back and refactor the link inside index.md (#4628)
* merge user manual back and refactor the link inside index.md

* update SHAIIC name in docs/manual/index.md

* add empty requirements.txt for mkdocs
2020-06-19 12:51:47 +08:00
Scarlett Li 2c604ef87d
Update README.md 2020-06-19 09:33:06 +08:00
Binyang2014 f08a92c74f
change storage manager QOS class from BestEffort to Burstable (#4630)
Before this change. The storage manager QOS class is BestEffort. When the node occur some condition, such as memory pressure/disk pressure. This pod will be evicted first.

Change to Burstable QOS class to avoid such evict.
2020-06-18 14:05:05 +08:00
Scarlett Li 15d8ff3543
adding shaicc's homepage link 2020-06-17 14:55:57 +08:00
yiyione c4544885b6
[Rest Server] Update rest-server swagger (#4617)
* update api version to 2.0.0 (#4420)

* [Hotfix] Update nullable fields in job api (#4437) (#4449)

* [Docs] Update nullable fields in job api (#4437)

Update nullable fields in job api.

* Update hived schema

Update hived schema.

Co-authored-by: Yuqing Yang <yuqyang@microsoft.com>
Co-authored-by: Yifan Xiong <yifan.xiong@microsoft.com>
2020-06-12 09:07:35 +08:00
yiyione 4f0c2026db
Use JS SDK in submit-job-v2 plugin (#4613)
* fix build

* use openpai-js-sdk

* fix yarn build
2020-06-12 09:06:50 +08:00
Mingliang Tao f2588ea441
Change home page img link (#4598) 2020-06-11 15:44:11 +08:00
dependabot[bot] 41e9552e05
Bump websocket-extensions from 0.1.3 to 0.1.4 in /src/webportal (#4607) 2020-06-08 07:04:21 +00:00
dependabot[bot] 13d8e86553
Bump websocket-extensions from 0.1.3 to 0.1.4 in /contrib/submit-job-v2 (#4608) 2020-06-08 05:16:33 +00:00
Zhiyuan He a81e3055aa
Update version and release note for 1.0.1 (#4569)
* fix

* empty trigger
2020-06-02 14:26:11 +08:00
Zhiyuan He b72077c997
Fix setuptools version in dev-box (#4595)
* fix

* fix
2020-06-02 13:32:52 +08:00
Yifan Xiong 8ae0242e55
Fix incorrect query in api docs (#4585)
Fix incorrect query for `/api/v2/jobs` in api docs.
2020-05-30 01:32:04 +08:00
Yifan Xiong d0fb4d2de7
Remove default webportal plugins in quick start (#4580)
Remove default webportal plugins in quick start.
`gerhut.github.io/store` is retired now. /cc @Gerhut

Closes #4579.
2020-05-28 17:15:43 +08:00
Zhiyuan He 8898310726
[Kubespray] make requirement check not to fail immediately (#4567)
* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix
2020-05-27 15:32:07 +08:00
Yifan Xiong f68e2e0c2e
[Device Plugin] Update device plugin yaml download script (#4563)
Update device plugin yaml download script, use svn client instead.
2020-05-26 12:30:44 +08:00
Binyang2014 570a352329
[storage] add write permission for dir under home folder (#4558) 2020-05-25 18:35:43 +08:00
Scarlett Li ca08bd19ae
Update README.md 2020-05-25 14:52:22 +08:00
Zhiyuan He ffff2ee5df
Cherry pick to master - Update PAI version (#4556) 2020-05-22 15:40:54 +08:00
Scarlett Li 81e029fb01
Update readme.md (#4549)
* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md (#4554)

* Update architecture pic in readme (#4553)

* fix link

* Remove coming soon

Co-authored-by: Zhiyuan He <362583303@qq.com>
Co-authored-by: Mingliang Tao <mintao@microsoft.com>
2020-05-22 15:40:31 +08:00
Zhiyuan He 2bbfde3295
Cherry pick to master - add release note (#4525)
* init

* init

* fix

* fix

* fix

* fix

* fix

* Update RELEASE_NOTE.md

* Update RELEASE_NOTE.md

* Add marketplace advantage (#4473)

* fix

* add version

* fix

* fix

* fix

Co-authored-by: Scarlett Li <39592018+scarlett2018@users.noreply.github.com>
Co-authored-by: Mingliang Tao <mintao@microsoft.com>
2020-05-22 15:40:16 +08:00
Binyang2014 cb392c1ac0
[Rest-server] Using hash function to generate port number (#4510)
Fix #4384
To avoid always retry due to port conflict, change to use:

(int(md5(podUid + taskPortName + portIndexForPortWithSameName)[0:12] ,16) +
 int(md5(podUid + taskPortName + portIndexForPortWithSameName)[12:24] ,16) +
 int(md5(podUid + taskPortName + portIndexForPortWithSameName)[24:32] ,16)) % 
 (globalPortEnd - globalPortStart) + globalPortStart
to generate port number.

Where globalPortEnd and globalPortStart can be configured by admin.

Notice:
This method is Best Efforts to avoid port conflict. Since we use hash function to generate port the possibility for port conflict is small.

We can not guarantee:
The ports in different jobs which running on same node will not conflict. (Same as previous)
The ports which belongs to one job but in different tasks will not conflict. (Different with previous)
All allocated ports in a single task will not conflict. (Different with previous)
2020-05-22 15:24:26 +08:00
Yifan Xiong 3e56bbf33e
[Rest Server] Support read only storage (#4523)
Support read only storage.
2020-05-21 13:53:07 +08:00
Yifan Xiong 8a1c57dbad
Force https if ssl cert is configured (#4550)
Force https if ssl cert is configured.
2020-05-21 11:38:47 +08:00
Binyang2014 0f05a1772b
limit PAI worker daemon port range (#4546)
refer to issue #4384.
Work host daemon may use port range which overlap PAI job port range. Change the work node port range to avoid conflict.
2020-05-19 22:43:50 +08:00
Zhiyuan He a2acd37144
Remove previous docs (#4545)
* fix

* fix

* fix

* fix

* fix

* bulk delete

* fix broken links

* fix

* fix

* trigger

* fix
2020-05-19 10:23:14 +08:00
YundongYe bdf7f49902
Configurable image_repo and extra_vars (#4528)
* Configurable image_repo and extra_vars

* Configurable image_repo and extra_vars
2020-05-18 11:56:51 +08:00
YundongYe fdcc098e8b
kubespray issue fix (#4537) 2020-05-18 11:52:49 +08:00
Scarlett Li d99d979f6e
Update system_architecture.md 2020-05-14 15:00:11 +08:00
Scarlett Li 9281c70983
Update README.md 2020-05-14 14:55:44 +08:00
Yifan Xiong f915f6ecda
[Docs] Remove outdated docs (#4446)
Remove outdated docs, including:
* `docs/zh_CN` which contains an outdated version of English docs
* YARN version related docs
* v1 job config examples
* refine README.md (#4505)
2020-05-13 19:21:07 +08:00
YundongYe dd08930431
Change default network plugin from weave to calico (#4519) 2020-05-13 13:24:16 +08:00
Zhiyuan He 861d58a19c
Fix upgrade doc to "migrate from yarn" (#4504)
* fix doc

* fix

* fix
2020-05-12 11:52:20 +08:00
Mingliang Tao 77a28db0d0
Convert totalGpuNumber from string to int (#4513) 2020-05-11 19:59:44 +08:00
yiyione 66bbda4d3c
fix IncorrectPasswordError typo (#4508) 2020-05-11 11:29:32 +08:00
Scarlett Li c607d55086
update readme.md
to replace the out of date "call for contribution" list
2020-05-08 14:58:51 +08:00
YundongYe ed6e66b369
Fix format string error (#4501) 2020-05-08 11:47:40 +08:00
Zhiyuan He 07a7965731
Select `default` vc when add new users (#4497)
* fix

* trigger
2020-05-07 13:34:40 +08:00
Yifan Xiong cebff8be2a
Update schema in storage api (#4491)
Update schema in storage api.
2020-05-07 13:08:00 +08:00
Mingliang Tao f7e155006e
Set user info after editing (#4495) 2020-05-07 10:44:05 +08:00
YundongYe 1dc242ade8
[pylon] support nginx ssl in basic mode. (#4493) 2020-05-07 10:16:15 +08:00
Zhiyuan He a8585bfe86
fix some typo in quick installation script (#4481) 2020-05-06 13:36:31 +08:00
YundongYe d57753a8ed
Openpai-Generator issue fix. (#4487) 2020-05-06 13:34:58 +08:00
YundongYe 4acfccf89b
configurable downlaod url (#4476) 2020-04-30 17:35:30 +08:00
dependabot[bot] d09a2ad470
Bump jquery from 3.4.1 to 3.5.0 in /src/webportal (#4475) 2020-04-30 05:29:25 +00:00
Mingliang Tao 9bb28da4b9
Refine job attempt schema (#4462) 2020-04-29 14:58:12 +08:00
Zhiyuan He 8364435db1
Change help links to user manual (#4427) 2020-04-28 16:07:35 +08:00
Yifan Xiong 060e769dd8
Update schema in vc api (#4464)
Update schema in vc api.
2020-04-28 14:56:45 +08:00