Граф коммитов

1442 Коммитов

Автор SHA1 Сообщение Дата
dependabot-preview[bot] 6b6bb726a9 Bump wmd from 1.3.1 to 1.3.2 (#1031)
Bumps [wmd](https://github.com/src-d/wmd-relax) from 1.3.1 to 1.3.2.
- [Release notes](https://github.com/src-d/wmd-relax/releases)
- [Commits](https://github.com/src-d/wmd-relax/commits)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-10-21 14:02:05 +01:00
dependabot-preview[bot] a36edbbe14 Bump shap from 0.30.2 to 0.31.0 (#1030)
Bumps [shap](https://github.com/slundberg/shap) from 0.30.2 to 0.31.0.
- [Release notes](https://github.com/slundberg/shap/releases)
- [Commits](https://github.com/slundberg/shap/compare/0.30.2...0.31.0)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-10-21 10:48:00 +01:00
Marco Castelluccio 518155a7b9 Specify matplotlib in the requirements
It was indirectly installed thanks to shap (until 0.31.0)
2019-10-21 10:33:39 +01:00
Marco Castelluccio 8becafa293 Version 0.0.134 2019-10-20 14:52:54 +01:00
Marco Castelluccio 0cfacecb57 Fix push_data.json.zst artifact path 2019-10-20 14:04:20 +01:00
Marco Castelluccio f2ff1c5a14 Version 0.0.133 2019-10-19 23:07:28 +01:00
Marco Castelluccio 35b4544718 Fix ADR cache fallback URL 2019-10-19 23:07:12 +01:00
Marco Castelluccio 6546cb0682 Fix log message 2019-10-19 23:06:01 +01:00
Marco Castelluccio cba3227296 Version 0.0.132 2019-10-19 21:41:36 +01:00
Marco Castelluccio 585741af22 Try downloading ADR cache from previous index URL if it's not available at the current index URL
We can remove this fallback after the first successful run of the new task.
2019-10-19 21:26:42 +01:00
Marco Castelluccio 940e97cdcf Be quiet when installing bugbug package in the test scheduling history push data retrieval task 2019-10-19 21:22:42 +01:00
dependabot-preview[bot] 0595eb99ad Bump pandas from 0.25.1 to 0.25.2 (#1027)
Bumps [pandas](https://github.com/pandas-dev/pandas) from 0.25.1 to 0.25.2.
- [Release notes](https://github.com/pandas-dev/pandas/releases)
- [Changelog](https://github.com/pandas-dev/pandas/blob/master/RELEASE.md)
- [Commits](https://github.com/pandas-dev/pandas/compare/v0.25.1...v0.25.2)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-10-19 11:42:17 +01:00
Marco Castelluccio 39f4a0509b Version 0.0.131 2019-10-18 14:21:19 +01:00
Marco Castelluccio 86a6d0a6b9 Fix dependency name
Regressed by dc3c3b83da
2019-10-18 14:20:57 +01:00
Marco Castelluccio db0486513b Version 0.0.130 2019-10-18 14:03:07 +01:00
Marco Castelluccio 5713425500 Use relman-svc compute for the ADR task
Since the tasks were split with dc3c3b83da,
the ADR task is not bounded by performance yet.
2019-10-18 13:38:13 +01:00
Marco Castelluccio dc3c3b83da Split test scheduling history retriever task into two 2019-10-18 13:33:53 +01:00
Marco Castelluccio 916f6569a3 Refactor test scheduling history retriever script a bit
Splitting ADR-stuff with ADR-retrieved data processing
2019-10-18 13:08:12 +01:00
Marco Castelluccio 5180afd05c Use previous test scheduling history DB when generating a new one with new pushes
Fixes #983
2019-10-18 13:01:18 +01:00
Caitlin White ef1cdc8394 Add basic test for defect_enhancement_task model (#1019)
Fixes #438
2019-10-18 11:56:23 +02:00
dependabot-preview[bot] 13973a6cc0 Bump taskcluster from 20.0.0 to 21.0.0 (#1026)
Bumps [taskcluster](https://github.com/taskcluster/taskcluster) from 20.0.0 to 21.0.0.
- [Release notes](https://github.com/taskcluster/taskcluster/releases)
- [Changelog](https://github.com/taskcluster/taskcluster/blob/master/CHANGELOG.md)
- [Commits](https://github.com/taskcluster/taskcluster/compare/v20.0.0...v21.0.0)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-10-18 11:55:23 +02:00
dependabot-preview[bot] ecf037cde8 Bump numpy from 1.17.2 to 1.17.3 (#1025)
Bumps [numpy](https://github.com/numpy/numpy) from 1.17.2 to 1.17.3.
- [Release notes](https://github.com/numpy/numpy/releases)
- [Changelog](https://github.com/numpy/numpy/blob/master/doc/HOWTO_RELEASE.rst.txt)
- [Commits](https://github.com/numpy/numpy/compare/v1.17.2...v1.17.3)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-10-18 11:54:50 +02:00
Marco Castelluccio bf95cd48c4 Version 0.0.129 2019-10-18 01:25:31 +01:00
Marco Castelluccio 3324f91f00 Keep using standard json library in the test scheduling history retrieval task
The orjson library doesn't support stream reading from a file, so here
the standard json library is preferable.
2019-10-18 01:25:31 +01:00
dependabot-preview[bot] 25e29d4fe0 Bump rs-parsepatch from 0.2.3 to 0.2.4 (#1024)
Bumps [rs-parsepatch]() from 0.2.3 to 0.2.4.

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-10-17 11:27:47 +02:00
Marco Castelluccio b4a56e7dcd Version 0.0.128 2019-10-16 16:28:06 +02:00
Marco Castelluccio 517519d7d6 Add a test for the removal of a non-existent DB 2019-10-16 16:20:53 +02:00
Marco Castelluccio 72e8961675 Don't fail when trying to remove a DB which does not exist
For example, the bug retriever is trying to delete old bugs, but
the deletion fails when we increase the version of the bugs DB and
we need to download a completely new one.
2019-10-16 16:19:11 +02:00
Marco Castelluccio ef767034df orjson.dumps returns bytes and not str, so we don't need no encoding 2019-10-16 16:18:50 +02:00
Marco Castelluccio 89e88981b7 Use orjson in cases where we need high performance 2019-10-16 16:10:29 +02:00
Marco Castelluccio deed153210 Version 0.0.127 2019-10-16 15:18:36 +02:00
dependabot-preview[bot] 90c676a4ce Bump taskcluster from 19.0.0 to 20.0.0 (#1020)
Bumps [taskcluster](https://github.com/taskcluster/taskcluster) from 19.0.0 to 20.0.0.
- [Release notes](https://github.com/taskcluster/taskcluster/releases)
- [Changelog](https://github.com/taskcluster/taskcluster/blob/master/CHANGELOG.md)
- [Commits](https://github.com/taskcluster/taskcluster/compare/v19.0.0...v20.0.0)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-10-16 14:47:04 +02:00
dependabot-preview[bot] 94d6d5cb4c Bump rs-parsepatch from 0.2.2 to 0.2.3 (#1022)
Bumps [rs-parsepatch]() from 0.2.2 to 0.2.3.

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>
2019-10-16 14:36:49 +02:00
Marco Castelluccio b2ab81d3e8 Pin CI image to Python 3.7 2019-10-16 14:07:03 +02:00
Marco Castelluccio 898dea8bdb Count total number of commits while iterating to get the file counts 2019-10-16 10:09:43 +02:00
Marco Castelluccio 7007118e88 Update Bugzilla DB version, as there are some component changes 2019-10-16 10:07:30 +02:00
Marco Castelluccio b3a3763a86 Version 0.0.126 2019-10-15 23:25:26 +02:00
Marco Castelluccio e230508a30 Exctractor's transform argument must be a function returning an iterable
Regression from c9f8613d79
2019-10-15 23:04:44 +02:00
Marco Castelluccio a6a4b471dd Version 0.0.125 2019-10-15 14:20:26 +02:00
Marco Castelluccio 3d36786b58 Install libffi-dev, required for cffi 2019-10-15 14:20:09 +02:00
Marco Castelluccio 91ea555aca Version 0.0.124 2019-10-15 12:09:51 +02:00
Marco Castelluccio 7f8e08c20d Add a task to train the test selection model 2019-10-12 17:31:28 +01:00
Marco Castelluccio c16bc7a362 Rename test_select.py to testselect.py
To avoid issues with the way the trainer script stores models on disk.
2019-10-12 17:31:03 +01:00
Marco Castelluccio 795acbcbc9 Specify DBs required by the test selection model 2019-10-12 17:30:50 +01:00
Marco Castelluccio 22a26e0873 Define required DBs in a attribute of the model
This way the trainer script doesn't have to rely on a brittle
check on the base class of the model

Fixes #520
2019-10-12 16:10:46 +01:00
Marco Castelluccio 863c1fbc48 Implement a WIP model for test selection 2019-10-12 00:58:39 +01:00
Marco Castelluccio 1bf5a8edf3 Make train/test split configurable by subclasses of Model
Some models might want to implement a different way to split between
train and test sets. For example, for test scheduling we'll need to
use the past past as the training set, and the recent past as the
test set. We can't mix them or the evaluation will be falsified.
2019-10-12 00:58:39 +01:00
Marco Castelluccio c68d469489 Make it possible to use the test job related features with the commit extractor 2019-10-12 00:58:39 +01:00
Marco Castelluccio ef0e4baa16 Add a module to extract test job related features 2019-10-12 00:58:39 +01:00
Marco Castelluccio c9f8613d79 During training, don't build a list with all trainable elements, but use a generator
It could use too much memory if there are too many elements.
2019-10-12 00:58:39 +01:00