Marco Castelluccio
7c81a5ece9
Split regressor finder task in four separate tasks
...
A task to find commits to ignore, a task to classify commits between
bug-fixing vs not-bug-fixing, a task to find regressors using the
normal repo, and a task to find regressors using the tokenized
repo.
This way we can also find regressors for the two kinds of repos
in parallel.
Fixes #1273
Make the past bugs by function task depend on the task to classify
commits between bug-fixing and not-bug-fixing.
Fixes #1274
2020-02-19 12:25:18 +01:00
Marco Castelluccio
e4320c4cd0
Use a smaller worker type when possible
2020-02-17 14:07:08 +01:00
Marco Castelluccio
7e0bf44c4a
Add required s3 scopes to the task scheduling history retriever task
2020-02-16 02:50:24 +01:00
Marco Castelluccio
49e57e66a3
Upload ADR cache periodically while the task runs
...
This way we don't lose the work done so far if the (very long running) task fails
for unknown reasons.
2020-02-15 20:16:26 +01:00
Marco Castelluccio
d891d901dd
Increase max runtime and deadline for test scheduling tasks
2020-02-13 20:11:36 +01:00
Marco Castelluccio
5d4346911f
Remove artifacts from the regressor finder task, as they are not used anymore
...
The artifacts are now uploaded to S3
2020-02-13 13:23:34 +01:00
Marco Castelluccio
1d5cd27b88
Remove artifacts from microannotate generator tasks, as they are not used anymore
2020-02-13 13:22:25 +01:00
Marco Castelluccio
52a5a7335c
Make microanntate generator task upload the DB version to S3
...
This way, even if the task times out, we don't lose the work done so far.
N.B.: We'll need to also update the DB URLs, when the bugbug S3 bucket is made public.
2020-02-12 13:43:54 +01:00
Marco Castelluccio
caa10dc2e8
Upload regressor finder results to S3 periodically while the task is running
...
This way, even if the task times out, we don't lose the work done so far.
N.B.: We'll need to also update the DB URLs, when the bugbug S3 bucket is made public.
2020-02-12 13:42:21 +01:00
Marco Castelluccio
f2b9ebb304
Make deadlines 5 days again, as the Taskcluster change has been reverted
...
See https://bugzilla.mozilla.org/show_bug.cgi?id=1604175
2020-02-12 01:19:25 +01:00
Marco Castelluccio
aaa67b3b0a
Move past bugs by function task after its dependencies
2020-02-12 00:39:19 +01:00
Marco Castelluccio
26aca814be
Run push_data recipe to retrieve test scheduling info both at the label- and at the group- level
...
As part of this, update to latest ci-recipes, including a plethora of changes and improvements.
Part of #1125
2020-02-11 16:33:21 +01:00
Marco Castelluccio
9e0a3357ed
Update deadline of the function to retrieve past bugs by function to be longer than the deadlines of its dependencies
2020-02-11 11:28:58 +01:00
Marco Castelluccio
d6786a8b95
Make the past bugs by function task depend on the regressor finder task
...
Since it is using the database of bug-fixing commits, which is currently
generated by the regressor finder task.
2020-01-30 14:46:50 +01:00
Marco Castelluccio
244df4e361
Increase deadlines and max runtimes for the microannotate tasks
2020-01-29 10:28:14 +01:00
Marco Castelluccio
cd2d816903
Increase deadline of the regressor finder task
2020-01-29 10:25:51 +01:00
Marco Castelluccio
cca43b2b51
Allow the regressor finder task to run for longer
2020-01-28 15:40:13 +01:00
Marco Castelluccio
1d5ea94735
Use a bigger worker type for the regressor finder task
2020-01-13 00:40:10 +01:00
Marco Castelluccio
b7c917f169
Use a smaller worker type for the task to generate past test scheduling history
2020-01-13 00:40:05 +01:00
Marco Castelluccio
0f9e3b786a
Use a smaller worker type for the task to generate past bugs by function
2020-01-13 00:39:58 +01:00
Marco Castelluccio
8c6813c094
Update dependencies of the HTTP service Docker build task to match the models the HTTP service needs
2020-01-12 13:12:21 +01:00
Ayush Shridhar
1752c8ed25
add spambug to http service ( #1213 )
...
Co-authored-by: Marco Castelluccio <mcastelluccio@mozilla.com>
2020-01-03 16:30:45 +01:00
Marco Castelluccio
10135b6ef5
Make the microannotate_generator script generate a version file and upload it as an artifact
...
This is a pre-requisite to regenerating the repos when there are meaningful changes in
rust-code-analysis or microannotate.
2019-12-21 14:56:15 +01:00
Marco Castelluccio
6d74a6bc40
Index microannotate tasks
2019-12-21 14:56:15 +01:00
Ayush Shridhar
fa5c68ed3b
Train the SpamBug model automatically on Taskcluster ( #1192 )
2019-12-21 12:01:23 +01:00
Marco Castelluccio
e0a9579695
Silence detachedHead warning in prod integration test
2019-12-17 22:49:51 +01:00
Marco Castelluccio
ea6e697e69
Move regressor-finder task after its dependencies
2019-12-17 10:51:33 +01:00
Marco Castelluccio
110914f226
Make regressor-finder task depend on the tasks training the DefectEnhancementTask and Regression models
...
Since these models are being used in the regressor finder script.
2019-12-16 19:55:25 +01:00
Marco Castelluccio
ea75d40804
Use 86400 as the maximum runtime for tasks
2019-12-16 18:43:41 +01:00
Marco Castelluccio
a177f67fb5
Make annotate tasks part of the normal pipeline
2019-12-16 17:55:47 +01:00
Marco Castelluccio
ce2e355fe4
Update classify patch hook only after gathering past bugs by function
2019-12-16 17:55:47 +01:00
Marco Castelluccio
010e129039
Make past-bugs-by-function task depend on commit-retrieval and bugs-retrieval tasks
2019-12-16 17:50:51 +01:00
Marco Castelluccio
d5cb3c4cb6
Move past-bugs-by-function task directly in the normal data pipeline
...
And remove the hook which was previously running it.
2019-12-16 16:42:25 +01:00
Marco Castelluccio
55e9dc00be
Adjust tasks' deadlines
2019-12-16 16:32:46 +01:00
Marco Castelluccio
d9536d24b8
Define deadlines in days instead of hours, when it is many hours
2019-12-16 16:24:55 +01:00
Marco Castelluccio
768cdfc13a
Deploy hooks after we're done training the models they depend on
2019-12-12 15:28:55 +01:00
Marco Castelluccio
dc030357b0
Deploy hooks after the training pipeline is done
...
Fixes #1168
2019-12-11 21:40:27 +01:00
Marco Castelluccio
1d0d1216a3
Use larger instance for the commit retrieval task
2019-12-11 14:19:54 +01:00
Marco Castelluccio
a5c0bb1f43
Add a task to train the TestFailure model
...
First part of #1127
2019-12-03 23:09:11 +01:00
Marco Castelluccio
b5e812d96b
Use a bigger instance for the task to train the test selection model
2019-12-03 23:06:09 +01:00
Marco Castelluccio
50e86b38f0
Use LMDB as backend for shelve
...
LMDB is considerably faster than GDBM and NDBM.
Fixes #1035
2019-11-18 15:26:21 +01:00
Marco Castelluccio
f6f9358eba
Use a larger worker type for the test scheduling history generator
2019-11-12 18:46:20 +01:00
Marco Castelluccio
da317d0c0f
Use new compute-small and compute-large worker types according to task needs
2019-11-12 18:24:45 +01:00
Marco Castelluccio
38dbc22e4b
Fix notify routes
2019-11-11 21:48:58 +01:00
Brian Stack
f54149cd90
Update .taskcluster.yml for community cluster ( #1076 )
2019-11-08 22:13:10 +01:00
Marco Castelluccio
145a69e295
Use db module to handle the ADR cache
...
Also switch from xz compression to zst
2019-11-07 14:29:25 +01:00
Marco Castelluccio
146b8dfbe5
Use versioned image for the integrationt test that runs as part of the data pipeline
2019-11-07 01:40:24 +01:00
Marco Castelluccio
015497076c
No need to install dependencies in the integration test that runs as part of the data pipeline
...
As we are using the latest commit-retrieval image, which already
has all dependencies installed.
2019-11-07 01:39:56 +01:00
Marco Castelluccio
a0c7731c19
Always use --disable-pip-version-check and --no-cache-dir pip options
2019-11-07 01:34:34 +01:00
Marco Castelluccio
8805e9aa9d
Install python-pip for the integration test
2019-11-07 01:32:40 +01:00