dependabot[bot]
6ee196fb8f
Bump axios from 0.21.0 to 0.21.2 in /src/alert-manager/src/alert-handler ( #5691 )
2022-01-25 01:45:30 +00:00
dependabot[bot]
6482ed9e34
Bump follow-redirects from 1.14.4 to 1.14.7 in /contrib/submit-job-v2 ( #5683 )
2022-01-25 01:44:45 +00:00
dependabot[bot]
754492b185
Bump shelljs from 0.8.4 to 0.8.5 in /src/alert-manager/src/alert-handler ( #5687 )
2022-01-25 01:43:57 +00:00
dependabot[bot]
74a8bc155b
Bump follow-redirects from 1.14.4 to 1.14.7 in /src/webportal ( #5680 )
2022-01-25 01:40:14 +00:00
Yuqi Wang
b0e4703ff5
Update FC image to v1.0.0 ( #5689 )
...
Ref: https://github.com/microsoft/frameworkcontroller/releases/tag/v1.0.0
2022-01-24 13:03:30 +08:00
Yi Yi
529db900c3
Add release note for v1.8.1 ( #5656 )
...
* add release note for v1.8.1
* update the version number in docs
* update release month
2021-12-27 14:09:57 +08:00
siaimes
6a2e836ef3
add summary for usage report alerts
2021-12-03 16:15:07 +08:00
Binyang2014
ea19af183e
update webportal dependencies ( #5635 )
...
update webportal dependencies
2021-10-20 15:19:03 +08:00
Binyang2014
4839405399
update CI ( #5636 )
...
Fix #5620 . UbuntuL16.04 is not supported in github action. Remove this type of OS.
And fix Lint issue
2021-10-20 13:25:57 +08:00
dependabot[bot]
6c505462d2
Bump axios from 0.21.1 to 0.21.4 in /contrib/submit-job-v2 ( #5618 )
...
Bumps [axios](https://github.com/axios/axios ) from 0.21.1 to 0.21.4.
- [Release notes](https://github.com/axios/axios/releases )
- [Changelog](https://github.com/axios/axios/blob/master/CHANGELOG.md )
- [Commits](https://github.com/axios/axios/compare/v0.21.1...v0.21.4 )
---
updated-dependencies:
- dependency-name: axios
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-09-22 19:38:00 -07:00
Binyang2014
d9e5074111
remove log4j for fixing security issue ( #5616 )
...
Remove log4j from frameworklauncher.
Framework launcher is not be used in pure k8s version and it uses log4j which has high security issue.
Remove the log4j dependency to fix security issue
2021-09-15 16:15:37 +08:00
dependabot[bot]
24b847dba4
Bump axios from 0.21.1 to 0.21.4 in /src/webportal ( #5617 )
...
Bumps [axios](https://github.com/axios/axios ) from 0.21.1 to 0.21.4.
- [Release notes](https://github.com/axios/axios/releases )
- [Changelog](https://github.com/axios/axios/blob/master/CHANGELOG.md )
- [Commits](https://github.com/axios/axios/compare/v0.21.1...v0.21.4 )
---
updated-dependencies:
- dependency-name: axios
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-09-15 15:47:26 +08:00
dependabot[bot]
5423166a7d
Bump snakeyaml from 1.18 to 1.26 in /subprojects/frameworklauncher/yarn ( #5518 )
...
Bumps [snakeyaml](https://bitbucket.org/asomov/snakeyaml ) from 1.18 to 1.26.
- [Commits](https://bitbucket.org/asomov/snakeyaml/branches/compare/snakeyaml-1.26..v1.18 )
---
updated-dependencies:
- dependency-name: org.yaml:snakeyaml
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-09-15 15:43:08 +08:00
dependabot[bot]
ccdb384080
Bump httpclient in /subprojects/frameworklauncher/yarn ( #5516 )
...
Bumps httpclient from 4.3.6 to 4.5.13.
---
updated-dependencies:
- dependency-name: org.apache.httpcomponents:httpclient
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-09-15 15:23:59 +08:00
dependabot[bot]
6634f21254
Bump path-parse from 1.0.6 to 1.0.7 in /src/database-controller/src ( #5604 )
2021-09-06 07:33:23 +00:00
dependabot[bot]
54b3954775
Bump path-parse from 1.0.6 to 1.0.7 in /src/database-controller/sdk ( #5603 )
2021-09-06 07:32:59 +00:00
dependabot[bot]
244e8edc14
Bump url-parse from 1.5.1 to 1.5.3 in /src/webportal ( #5602 )
2021-09-06 07:32:30 +00:00
dependabot[bot]
1500db9c72
Bump url-parse from 1.5.1 to 1.5.3 in /contrib/submit-job-v2 ( #5601 )
2021-09-06 07:32:01 +00:00
dependabot[bot]
5f949b2b7a
Bump glob-parent from 5.1.1 to 5.1.2 in /src/database-controller/sdk ( #5529 )
2021-09-06 07:27:26 +00:00
dependabot[bot]
f53130bcb9
Bump glob-parent from 5.1.1 to 5.1.2 in /src/database-controller/src ( #5530 )
2021-09-06 07:22:16 +00:00
dependabot[bot]
981f17c5f0
Bump glob-parent from 5.1.1 to 5.1.2 in /src/rest-server ( #5526 )
2021-09-06 07:16:02 +00:00
dependabot[bot]
815ba939e6
Bump path-parse from 1.0.6 to 1.0.7 in /contrib/submit-job-v2 ( #5599 )
2021-08-12 08:31:42 +00:00
dependabot[bot]
71caba55a2
Bump path-parse from 1.0.6 to 1.0.7 in /src/webportal ( #5598 )
2021-08-12 08:31:17 +00:00
dependabot[bot]
64ed3af85d
Bump color-string from 1.5.3 to 1.6.0 in /src/webportal ( #5594 )
2021-08-12 08:26:58 +00:00
dependabot[bot]
5ef2b63f6f
Bump path-parse from 1.0.6 to 1.0.7 in /src/rest-server ( #5597 )
2021-08-12 08:25:49 +00:00
dependabot[bot]
6f6e01cd3d
Bump postcss from 7.0.14 to 7.0.36 in /contrib/submit-job-v2 ( #5532 )
2021-08-12 08:24:58 +00:00
dependabot[bot]
d1d206b4ba
Bump postcss from 7.0.17 to 7.0.36 in /src/webportal ( #5531 )
2021-08-12 08:24:55 +00:00
dependabot[bot]
aa8626a0e5
Bump merge-deep from 3.0.2 to 3.0.3 in /src/webportal ( #5524 )
2021-08-12 08:23:41 +00:00
Zhiyuan He
cba9e46145
fix link in readme ( #5595 )
2021-08-12 11:17:52 +08:00
Binyang2014
6409d891e8
Bump runtime version ( #5600 )
2021-08-12 10:39:40 +08:00
Zhiyuan He
19e11d88e5
fix doc related to china deployment ( #5593 )
...
* fix
* fix
2021-08-10 16:02:12 +08:00
Guoxin
bf9290e8db
adjust grafana to fit more metrics ( #5591 )
...
- support more metrics, including
- node_memory_bytes with `type` label
- node_disk_other_bytes_total, task_block_other_byte
- get task cpu utilization with `task_cpu_seconds_total`
- task_network_receive_bytes_total, task_network_transmit_bytes_total
- avoid wrongly computed 100% cpu utilization by using `idelta`
- use `irate` instead of `rate` for fast-moving metrics & change the computing interval
- set `editable` as true in all the dashboards
2021-08-10 14:29:09 +08:00
Guoxin
2f1fcab9dc
Add Prometheus Pushgateway as an optional service ( #5590 )
...
- Add an optional service Prometheus Pushagteway
- add a container `metrics-cleaner` to clean Pushgateway metrics by fixed interval
- add prometheus-pushgateway in job-exporter
- set `honor_lables` as true in Prometheus
2021-08-10 14:26:31 +08:00
siaimes
e5aef0d344
fix missing `WEBPORTAL_URL` issue when installing services ( #5538 )
...
[issue comment](https://github.com/microsoft/pai/issues/5445#issuecomment-827309308 )
2021-08-09 10:46:15 +08:00
siaimes
4eef17effc
Use sed instead of pip to change ansible version ( #5573 )
...
Signed-off-by: siaimes <34199488+siaimes@users.noreply.github.com>
2021-07-29 10:21:48 +08:00
Zhiyuan He
b4ab39cc55
make enable_docker_cache effective ( #5574 )
2021-07-28 10:30:31 +08:00
Binyang2014
8cd594733c
Fix: change tail log to 16KB ( #5575 )
2021-07-27 17:18:22 +08:00
siaimes
b9e16c78ab
Fix update docker cache error ( #5539 )
...
Fix update docker cache error: [issue comment](https://github.com/microsoft/pai/issues/5445#issuecomment-826238676 ).
If /etc/docker/daemon.json doesn't exist or is an empty file, the script will fail.
2021-07-18 18:19:04 +08:00
Yi Yi
68d29eff32
Add release note for v1.8.0 ( #5556 )
...
* Add release note for v1.8.0
* update
* update installation-guide
* update image tag in kubespray
2021-07-14 17:22:12 +08:00
Yi Yi
f34c672222
Fix alerts data error in webportal ( #5562 )
2021-07-14 17:21:24 +08:00
Guoxin
46c56243a2
Fix get alerts API issue ( #5560 )
2021-07-13 15:40:03 +08:00
Guoxin
89525cd543
fix cluster utilization pylint issue ( #5551 )
2021-07-02 20:02:24 +08:00
Starmie@Choice Specs
baf35d9a93
turn on all nodes if has_pending_pods ( #5545 )
...
Co-authored-by: Chengruidong Zhang (FA Talent) <v-chenzhang@microsoft.com>
2021-06-25 13:38:25 +08:00
Starmie@Choice Specs
34fc8f600d
Autoscaler in the main docs ( #5523 )
...
* Autoscaler in the main doc
* autoscaler in the catalogue
* reindex
Co-authored-by: Chengruidong Zhang (FA Talent) <v-chenzhang@microsoft.com>
2021-06-21 10:55:01 +08:00
Guoxin
bcdeb64ca7
fix alert-manager config generation issue ( #5534 )
2021-06-16 14:37:03 +08:00
Binyang2014
650a90d3a1
set cluster.advertise-address to avoid docker use non-private ip ( #5533 )
...
If docker use no-private ip. There will be error: alertmanager: no private IP found.
Set cluster.advertise-address to avoid this issue.
Refer: https://github.com/prometheus/alertmanager/issues/2284#issuecomment-640044282
2021-06-16 14:36:16 +08:00
Guoxin
4a2c46b5b0
[job status change notif] bug fix ( #5521 )
...
* fix link issue in email templates
* fix stopped / running status check issue
* refine doc
2021-06-10 14:59:36 +08:00
Yi Yi
5153a4d9a6
[Webportal] Support Job Priority in job-list ( #5525 )
...
* update hivedScheduler.jobPriorityClass spell in db controller
* add jobPriority to frameworkConverter
* update
* update
* update
* update
* update swagger
* add default jobPriority support
* fix
* fix
* add job priority to table and ordering
* add priority filter
* update
* fix
* fix
* fix lint
2021-06-10 14:23:50 +08:00
Yi Yi
6625542870
[Rest-server] Update type of TaskUid in swagger ( #5517 )
...
* Update type of TaskUid in swagger
* update
* update version
* update
2021-06-08 14:12:55 +08:00
dependabot[bot]
1e19e83eb5
Bump ws from 6.2.1 to 6.2.2 in /src/webportal ( #5514 )
2021-06-07 08:46:07 +00:00