Граф коммитов

1555 Коммитов

Автор SHA1 Сообщение Дата
Marco Castelluccio dc66faa4a0 Add Taskcluster index fallback URL in utils.get_last_modified too 2019-11-10 20:22:34 +01:00
Marco Castelluccio bcca240251 Remove unused pytest fixtures 2019-11-10 20:22:12 +01:00
Marco Castelluccio b5b53a29d5 Version 0.0.144 2019-11-10 20:05:11 +01:00
Marco Castelluccio d06f768c85 Bump Bugzilla DB version to refresh bugs
As there were recently some changes in DOM components
2019-11-10 20:02:38 +01:00
Marco Castelluccio dcbe49e967 Consider all commits in a given push when calculating past failure stats
Also store all revisions in the test scheduling data, so we can use them
in the test selection model too.

First part of #1017
2019-11-10 20:01:20 +01:00
Arjun Krishna f487893f7f Output prediction class name in bug_classifier script (#1094)
Fixes #1070
2019-11-10 18:17:39 +01:00
dependabot-preview[bot] b61e8aca7b Bump taskcluster from 22.1.0 to 22.1.1 (#1096)
Bumps [taskcluster](https://github.com/taskcluster/taskcluster) from 22.1.0 to 22.1.1.
- [Release notes](https://github.com/taskcluster/taskcluster/releases)
- [Changelog](https://github.com/taskcluster/taskcluster/blob/master/CHANGELOG.md)
- [Commits](https://github.com/taskcluster/taskcluster/compare/v22.1.0...v22.1.1)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-11-10 18:13:35 +01:00
dependabot-preview[bot] 78f8c3e54c Bump scipy from 1.3.1 to 1.3.2 (#1095)
Bumps [scipy](https://github.com/scipy/scipy) from 1.3.1 to 1.3.2.
- [Release notes](https://github.com/scipy/scipy/releases)
- [Commits](https://github.com/scipy/scipy/commits)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-11-10 18:12:58 +01:00
dependabot-preview[bot] 2124f60a9a Bump gunicorn from 19.9.0 to 20.0.0 (#1091)
Bumps [gunicorn](https://github.com/benoitc/gunicorn) from 19.9.0 to 20.0.0.
- [Release notes](https://github.com/benoitc/gunicorn/releases)
- [Commits](https://github.com/benoitc/gunicorn/compare/19.9.0...20.0.0)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-11-09 11:09:20 +01:00
dependabot-preview[bot] 7f1e1146d8 Bump tqdm from 4.37.0 to 4.38.0 (#1090)
Bumps [tqdm](https://github.com/tqdm/tqdm) from 4.37.0 to 4.38.0.
- [Release notes](https://github.com/tqdm/tqdm/releases)
- [Commits](https://github.com/tqdm/tqdm/compare/v4.37.0...v4.38.0)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-11-09 11:09:11 +01:00
dependabot-preview[bot] 7a2cb2f3ba Bump taskcluster from 22.0.0 to 22.1.0 (#1089)
Bumps [taskcluster](https://github.com/taskcluster/taskcluster) from 22.0.0 to 22.1.0.
- [Release notes](https://github.com/taskcluster/taskcluster/releases)
- [Changelog](https://github.com/taskcluster/taskcluster/blob/master/CHANGELOG.md)
- [Commits](https://github.com/taskcluster/taskcluster/compare/v22.0.0...v22.1.0)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-11-09 01:04:08 +01:00
Marco Castelluccio 039881e0a9 Add a few tests for utils.download_check_etag
The main objective was to make sure that the fallback introduced
by f54149cd90 is working correctly.
2019-11-08 23:49:36 +01:00
Marco Castelluccio a65fa07f8c Add mocks for fallback Taskcluster URLs
Follow-up to f54149cd90
2019-11-08 23:48:27 +01:00
Marco Castelluccio 612848e7a7 Fix linting issue in utils module
Follow-up to f54149cd90
2019-11-08 22:15:57 +01:00
Brian Stack f54149cd90 Update .taskcluster.yml for community cluster (#1076) 2019-11-08 22:13:10 +01:00
Marco Castelluccio abff8d3fb5 Version 0.0.143 2019-11-08 15:50:50 +01:00
dependabot-preview[bot] 715fd61a0c Bump orjson from 2.1.1 to 2.1.2 (#1088)
Bumps [orjson](https://github.com/ijl/orjson) from 2.1.1 to 2.1.2.
- [Release notes](https://github.com/ijl/orjson/releases)
- [Changelog](https://github.com/ijl/orjson/blob/master/CHANGELOG.md)
- [Commits](https://github.com/ijl/orjson/compare/2.1.1...2.1.2)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-11-08 11:25:03 +01:00
Marco Castelluccio 145a69e295 Use db module to handle the ADR cache
Also switch from xz compression to zst
2019-11-07 14:29:25 +01:00
Marco Castelluccio 9cb721af38 Version 0.0.142 2019-11-07 01:42:11 +01:00
Marco Castelluccio ec3bde2a11 No need to install the development requirements for the integration test 2019-11-07 01:41:47 +01:00
Marco Castelluccio 146b8dfbe5 Use versioned image for the integrationt test that runs as part of the data pipeline 2019-11-07 01:40:24 +01:00
Marco Castelluccio 015497076c No need to install dependencies in the integration test that runs as part of the data pipeline
As we are using the latest commit-retrieval image, which already
has all dependencies installed.
2019-11-07 01:39:56 +01:00
Marco Castelluccio a0c7731c19 Always use --disable-pip-version-check and --no-cache-dir pip options 2019-11-07 01:34:34 +01:00
Marco Castelluccio 8805e9aa9d Install python-pip for the integration test 2019-11-07 01:32:40 +01:00
Marco Castelluccio de664499ae Version 0.0.141 2019-11-07 01:23:56 +01:00
Marco Castelluccio 42e544e5af The first column of push_data.json rows now contain a list of revisions
Since f5059c4bd4.
2019-11-07 01:09:10 +01:00
Marco Castelluccio 601e551ea3 Stop fallbacking on old ADR cache URL 2019-11-07 01:09:10 +01:00
dependabot-preview[bot] 764ade7e3e Bump shap from 0.31.0 to 0.32.0 (#1083)
Bumps [shap](https://github.com/slundberg/shap) from 0.31.0 to 0.32.0.
- [Release notes](https://github.com/slundberg/shap/releases)
- [Commits](https://github.com/slundberg/shap/compare/0.31.0...0.32.0)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-11-06 18:27:44 +01:00
Ayush Shridhar 71789adf00 Add number of words in summary as a feature for the spambug model (#1079) 2019-11-06 16:26:47 +01:00
dependabot-preview[bot] 4b45f3292b Bump rq-dashboard from 0.5.3 to 0.6.0 (#1078)
Bumps [rq-dashboard](https://github.com/Parallels/rq-dashboard) from 0.5.3 to 0.6.0.
- [Release notes](https://github.com/Parallels/rq-dashboard/releases)
- [Commits](https://github.com/Parallels/rq-dashboard/compare/0.5.3...0.6.0)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-11-05 18:48:20 +01:00
Vishal b9f236216e Add 'arch' as a feature for test scheduling (#1040)
Fixes #1018
2019-11-05 18:03:08 +01:00
Marco Castelluccio 0778950295 Version 0.0.140 2019-11-05 13:13:07 +01:00
Marco Castelluccio 94d1262ffb Ignore build-docker-image-* jobs 2019-11-05 12:31:04 +01:00
Marco Castelluccio f6c022a087 Calculate median, percentages and correlation after ignoring 0 values 2019-11-05 12:28:55 +01:00
dependabot-preview[bot] 0941d52cd2 Bump taskcluster from 21.3.0 to 22.0.0 (#1073)
Bumps [taskcluster](https://github.com/taskcluster/taskcluster) from 21.3.0 to 22.0.0.
- [Release notes](https://github.com/taskcluster/taskcluster/releases)
- [Changelog](https://github.com/taskcluster/taskcluster/blob/master/CHANGELOG.md)
- [Commits](https://github.com/taskcluster/taskcluster/compare/v21.3.0...v22.0.0)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-11-05 00:54:20 +01:00
dependabot-preview[bot] b249365ff0 Bump apispec from 3.0.0 to 3.1.0 (#1074)
Bumps [apispec](https://github.com/marshmallow-code/apispec) from 3.0.0 to 3.1.0.
- [Release notes](https://github.com/marshmallow-code/apispec/releases)
- [Changelog](https://github.com/marshmallow-code/apispec/blob/dev/CHANGELOG.rst)
- [Commits](https://github.com/marshmallow-code/apispec/compare/3.0.0...3.1.0)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-11-05 00:54:06 +01:00
dependabot-preview[bot] 8588ef5c92 Bump marshmallow from 3.2.1 to 3.2.2 (#1075)
Bumps [marshmallow](https://github.com/marshmallow-code/marshmallow) from 3.2.1 to 3.2.2.
- [Release notes](https://github.com/marshmallow-code/marshmallow/releases)
- [Changelog](https://github.com/marshmallow-code/marshmallow/blob/dev/CHANGELOG.rst)
- [Commits](https://github.com/marshmallow-code/marshmallow/compare/3.2.1...3.2.2)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-11-05 00:51:44 +01:00
Marco Castelluccio fc67592888 Version 0.0.139 2019-11-04 16:07:30 +01:00
dependabot-preview[bot] b3ae6925bd Bump libmozdata from 0.1.59 to 0.1.60 (#1071)
Bumps [libmozdata](https://github.com/mozilla/libmozdata) from 0.1.59 to 0.1.60.
- [Release notes](https://github.com/mozilla/libmozdata/releases)
- [Commits](https://github.com/mozilla/libmozdata/commits)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-11-04 16:02:58 +01:00
dependabot-preview[bot] c7dd055bb3 Bump python-dateutil from 2.8.0 to 2.8.1 (#1072)
Bumps [python-dateutil](https://github.com/dateutil/dateutil) from 2.8.0 to 2.8.1.
- [Release notes](https://github.com/dateutil/dateutil/releases)
- [Changelog](https://github.com/dateutil/dateutil/blob/master/NEWS)
- [Commits](https://github.com/dateutil/dateutil/compare/2.8.0...2.8.1)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-11-04 16:02:40 +01:00
Marco Castelluccio cfc6e8b0cc Use a different background dataset according to the predicted class
If we predict a "risky" commit, compare it with a background dataset
of "clean" commits.
If we predict a "clean" commit, compare it with a background dataset
of "risky" commits.
2019-11-04 15:56:01 +01:00
Marco Castelluccio b571c4a843 Print a maximum of three features per row to avoid overflow 2019-11-04 15:56:01 +01:00
Marco Castelluccio eb3d56286e Improve calculation of shap values (and thus feature importance) in the binary classification case
Making it more in line with the multiclass case.

In particular, this is improving the case where the predicted class is 0 (e.g. a "clean" commit).
2019-11-04 15:54:26 +01:00
Marco Castelluccio 46e3b2c00c Add more defect/enhancement/task labels gathered from changed made by users on Bugzilla 2019-11-04 15:29:50 +01:00
Marco Castelluccio 409c06a593 Don't use internal label encoder, but the model interface to it 2019-11-04 12:19:51 +01:00
Marco Castelluccio 9756f4ff89 force_plot output was removed, so don't use it anymore in the bug classifier script 2019-11-04 12:19:12 +01:00
Marco Castelluccio c913f88302 Capitalize feature names after removing the common prefixes 2019-11-03 13:37:02 +01:00
Marco Castelluccio a5a39fbc4a Lower importance threshold to have more features to show 2019-11-03 13:26:02 +01:00
Marco Castelluccio a7b6a49b74 Find max directly instead of sorting 2019-11-03 13:14:24 +01:00
Marco Castelluccio 321076298e Actually get most explainable feature for each group
We were reverse sorting and then getting the last element, so we
were getting the lowest.

Follow-up for #1058
2019-11-03 12:53:19 +01:00