Граф коммитов

131 Коммитов

Автор SHA1 Сообщение Дата
Marco Castelluccio 7c81a5ece9 Split regressor finder task in four separate tasks
A task to find commits to ignore, a task to classify commits between
bug-fixing vs not-bug-fixing, a task to find regressors using the
normal repo, and a task to find regressors using the tokenized
repo.

This way we can also find regressors for the two kinds of repos
in parallel.

Fixes #1273

Make the past bugs by function task depend on the task to classify
commits between bug-fixing and not-bug-fixing.

Fixes #1274
2020-02-19 12:25:18 +01:00
Marco Castelluccio e4320c4cd0 Use a smaller worker type when possible 2020-02-17 14:07:08 +01:00
Marco Castelluccio 7e0bf44c4a Add required s3 scopes to the task scheduling history retriever task 2020-02-16 02:50:24 +01:00
Marco Castelluccio 49e57e66a3 Upload ADR cache periodically while the task runs
This way we don't lose the work done so far if the (very long running) task fails
for unknown reasons.
2020-02-15 20:16:26 +01:00
Marco Castelluccio d891d901dd Increase max runtime and deadline for test scheduling tasks 2020-02-13 20:11:36 +01:00
Marco Castelluccio 5d4346911f Remove artifacts from the regressor finder task, as they are not used anymore
The artifacts are now uploaded to S3
2020-02-13 13:23:34 +01:00
Marco Castelluccio 1d5cd27b88 Remove artifacts from microannotate generator tasks, as they are not used anymore 2020-02-13 13:22:25 +01:00
Marco Castelluccio 52a5a7335c Make microanntate generator task upload the DB version to S3
This way, even if the task times out, we don't lose the work done so far.
N.B.: We'll need to also update the DB URLs, when the bugbug S3 bucket is made public.
2020-02-12 13:43:54 +01:00
Marco Castelluccio caa10dc2e8 Upload regressor finder results to S3 periodically while the task is running
This way, even if the task times out, we don't lose the work done so far.
N.B.: We'll need to also update the DB URLs, when the bugbug S3 bucket is made public.
2020-02-12 13:42:21 +01:00
Marco Castelluccio f2b9ebb304 Make deadlines 5 days again, as the Taskcluster change has been reverted
See https://bugzilla.mozilla.org/show_bug.cgi?id=1604175
2020-02-12 01:19:25 +01:00
Marco Castelluccio aaa67b3b0a Move past bugs by function task after its dependencies 2020-02-12 00:39:19 +01:00
Marco Castelluccio 26aca814be Run push_data recipe to retrieve test scheduling info both at the label- and at the group- level
As part of this, update to latest ci-recipes, including a plethora of changes and improvements.

Part of #1125
2020-02-11 16:33:21 +01:00
Marco Castelluccio 9e0a3357ed Update deadline of the function to retrieve past bugs by function to be longer than the deadlines of its dependencies 2020-02-11 11:28:58 +01:00
Marco Castelluccio d6786a8b95 Make the past bugs by function task depend on the regressor finder task
Since it is using the database of bug-fixing commits, which is currently
generated by the regressor finder task.
2020-01-30 14:46:50 +01:00
Marco Castelluccio 244df4e361 Increase deadlines and max runtimes for the microannotate tasks 2020-01-29 10:28:14 +01:00
Marco Castelluccio cd2d816903 Increase deadline of the regressor finder task 2020-01-29 10:25:51 +01:00
Marco Castelluccio cca43b2b51 Allow the regressor finder task to run for longer 2020-01-28 15:40:13 +01:00
Marco Castelluccio 1d5ea94735 Use a bigger worker type for the regressor finder task 2020-01-13 00:40:10 +01:00
Marco Castelluccio b7c917f169 Use a smaller worker type for the task to generate past test scheduling history 2020-01-13 00:40:05 +01:00
Marco Castelluccio 0f9e3b786a Use a smaller worker type for the task to generate past bugs by function 2020-01-13 00:39:58 +01:00
Marco Castelluccio 8c6813c094 Update dependencies of the HTTP service Docker build task to match the models the HTTP service needs 2020-01-12 13:12:21 +01:00
Ayush Shridhar 1752c8ed25 add spambug to http service (#1213)
Co-authored-by: Marco Castelluccio <mcastelluccio@mozilla.com>
2020-01-03 16:30:45 +01:00
Marco Castelluccio 10135b6ef5 Make the microannotate_generator script generate a version file and upload it as an artifact
This is a pre-requisite to regenerating the repos when there are meaningful changes in
rust-code-analysis or microannotate.
2019-12-21 14:56:15 +01:00
Marco Castelluccio 6d74a6bc40 Index microannotate tasks 2019-12-21 14:56:15 +01:00
Ayush Shridhar fa5c68ed3b Train the SpamBug model automatically on Taskcluster (#1192) 2019-12-21 12:01:23 +01:00
Marco Castelluccio e0a9579695 Silence detachedHead warning in prod integration test 2019-12-17 22:49:51 +01:00
Marco Castelluccio ea6e697e69 Move regressor-finder task after its dependencies 2019-12-17 10:51:33 +01:00
Marco Castelluccio 110914f226 Make regressor-finder task depend on the tasks training the DefectEnhancementTask and Regression models
Since these models are being used in the regressor finder script.
2019-12-16 19:55:25 +01:00
Marco Castelluccio ea75d40804 Use 86400 as the maximum runtime for tasks 2019-12-16 18:43:41 +01:00
Marco Castelluccio a177f67fb5 Make annotate tasks part of the normal pipeline 2019-12-16 17:55:47 +01:00
Marco Castelluccio ce2e355fe4 Update classify patch hook only after gathering past bugs by function 2019-12-16 17:55:47 +01:00
Marco Castelluccio 010e129039 Make past-bugs-by-function task depend on commit-retrieval and bugs-retrieval tasks 2019-12-16 17:50:51 +01:00
Marco Castelluccio d5cb3c4cb6 Move past-bugs-by-function task directly in the normal data pipeline
And remove the hook which was previously running it.
2019-12-16 16:42:25 +01:00
Marco Castelluccio 55e9dc00be Adjust tasks' deadlines 2019-12-16 16:32:46 +01:00
Marco Castelluccio d9536d24b8 Define deadlines in days instead of hours, when it is many hours 2019-12-16 16:24:55 +01:00
Marco Castelluccio 768cdfc13a Deploy hooks after we're done training the models they depend on 2019-12-12 15:28:55 +01:00
Marco Castelluccio dc030357b0 Deploy hooks after the training pipeline is done
Fixes #1168
2019-12-11 21:40:27 +01:00
Marco Castelluccio 1d0d1216a3 Use larger instance for the commit retrieval task 2019-12-11 14:19:54 +01:00
Marco Castelluccio a5c0bb1f43 Add a task to train the TestFailure model
First part of #1127
2019-12-03 23:09:11 +01:00
Marco Castelluccio b5e812d96b Use a bigger instance for the task to train the test selection model 2019-12-03 23:06:09 +01:00
Marco Castelluccio 50e86b38f0 Use LMDB as backend for shelve
LMDB is considerably faster than GDBM and NDBM.

Fixes #1035
2019-11-18 15:26:21 +01:00
Marco Castelluccio f6f9358eba Use a larger worker type for the test scheduling history generator 2019-11-12 18:46:20 +01:00
Marco Castelluccio da317d0c0f Use new compute-small and compute-large worker types according to task needs 2019-11-12 18:24:45 +01:00
Marco Castelluccio 38dbc22e4b Fix notify routes 2019-11-11 21:48:58 +01:00
Brian Stack f54149cd90 Update .taskcluster.yml for community cluster (#1076) 2019-11-08 22:13:10 +01:00
Marco Castelluccio 145a69e295 Use db module to handle the ADR cache
Also switch from xz compression to zst
2019-11-07 14:29:25 +01:00
Marco Castelluccio 146b8dfbe5 Use versioned image for the integrationt test that runs as part of the data pipeline 2019-11-07 01:40:24 +01:00
Marco Castelluccio 015497076c No need to install dependencies in the integration test that runs as part of the data pipeline
As we are using the latest commit-retrieval image, which already
has all dependencies installed.
2019-11-07 01:39:56 +01:00
Marco Castelluccio a0c7731c19 Always use --disable-pip-version-check and --no-cache-dir pip options 2019-11-07 01:34:34 +01:00
Marco Castelluccio 8805e9aa9d Install python-pip for the integration test 2019-11-07 01:32:40 +01:00