Граф коммитов

18 Коммитов

Автор SHA1 Сообщение Дата
Anurag Aggarwal 656d6e844b Remove bugs_retrieval image and use the base image instead in its place (#691)
* Fixes #633
2019-07-12 14:17:41 +02:00
Boris Feld 7459f79317 Use the base image for training models (#656)
Fixes #350
2019-06-29 00:01:51 +02:00
Boris Feld 417f2aae69 Use a single spawn image for both hooks (#594)
This removes the need to keep two different Docker images for spawning the
data pipeline and the check pipeline.

Fixes #365
2019-06-14 18:54:21 +02:00
Boris Feld a8faa48d8a Support classifying batches of bugs with a background worker (#321) 2019-06-07 21:22:14 +02:00
Marco Castelluccio 72ddfea2e3 Add a Docker image for the task to train the Regressor model 2019-06-03 21:46:35 +02:00
Marco Castelluccio aebc3c4414 Add bugbug-train-duplicate to docker-compose.yml 2019-05-31 18:03:03 +02:00
Marco Castelluccio cd0405d223 Add a Docker image for the Backout model 2019-05-30 13:24:55 +02:00
Marco Castelluccio d4042a6eba Add train-regressionrange and train-stepstoreproduce to docker-compose.yml 2019-05-25 11:42:13 +02:00
Boris Feld 1bae5834ab Implement deployment to Heroku (#458) 2019-05-23 20:39:02 +02:00
Boris Feld f4b2b938be
Add basic check method and check script (#341)
* Add basic check method and check script

* Ensure the check of component will correctly use super result

* Add required infra to schedule model checks

* Add scheduling bits for the model checks

* Remove the filtering on classification

* Extract counting bugs to a new function in bugzilla.py

* Also checks conflated components

* Fix new hook id

* Call bugzilla with the count_only param to speed up the check

* Fix the new hook scope to match the hook id

* Fix component model check after previous refactoring

* Fix component model check method

* Use a bugzilla report for even faster component model check

* Clarify get_product_component_count docstring

We are already filtering out full component with 0 bugs

* Update conflated components mapping check

A conflated component could also be part of the conflated components mapping

* Distinguish between non-existing full components and empty full components

* Remove the filter on resolution and unnecessary url params

* Update component check method

Keep checks as separate as possible for clarity, we could merge them or makes
them faster later

* Generate dynamically the CSV report url

* Fix Docker image name the hook

* Implement component check number 5

Get the meaningful components for the last 6 months

* Handle reviews comments

* Remove extraneous print

* Removes TODO

* Use a different threshold ration when checking for new meaningful components

As we are only checking new bugs for 6 months, adjust the threshold ration to
be less sensitive to occasional burst ob bugs for q given component.

* Reduce the threshold ratio

As we check on a disjoint time window, reduce the chance of false positives

* Handle review nits

* Fix last nits
2019-05-10 12:20:23 +02:00
Marco 6f09488573
Rename mozilla/bugbug-train-defect image to mozilla/bugbug-train-defectenhancementtask (#375)
Fixes #364.
2019-05-09 23:36:38 +02:00
Marco Castelluccio c3f55e682a Rename train-defect to train-defectenhancementtask 2019-05-07 13:16:22 +02:00
Boris Feld 297963e4ce Skip checking models while building the http service image, and only push it as part of the pipeline (#331)
* Add a way to skip checking models while building the http service image

* Don't push the http service on release

It isn't built with the real models on release

* Use taskboot 0.1.1
2019-05-02 23:18:51 +02:00
Boris Feld 369b44ea02 Update the index URLs in bugbug (#328)
* Update the index URLs in bugbug

* Split the http service Docker image in two

This way we can both:
- Build the first half (code + dependencies) in the usual CI.
- Build the second half at the end of the data pipeline with updated models.

Taskboot build-compose doesn't support building all services except a
specific one and it might be cumbersome to add this feature so move the second
half of the Docker image to a separate docker-compose file.
2019-05-02 17:00:32 +02:00
Boris Feld 6e7ca892cd Introduce a new Docker image for data-pipeline spawning (#320) 2019-05-02 14:36:50 +02:00
Boris Feld 706b316a9c Add basic HTTP service (#262)
* Add basic HTTP service

It doesn't support batch requests or authentication yet

* Add very basic authentication

* Update dockerignore in http-service
2019-04-10 17:57:07 +02:00
Boris Feld 6af6e8b927 Import Trainer class from release-services repository (#254)
* Import Trainer class from release-services repository

This basically import the `trainer.py` file from the `release-services`
repository at hash 77cdddd. I removed imports and reference to cli-common
helpers that will likely need to be reimplemented, like the raven support.

Also defines 4 docker images, one per model to train.

* Remove unused imports
2019-04-09 17:49:56 +02:00
Boris Feld b651744b18 Import retriever services and add Docker image definition (#251)
* Import Retriever class from release-services repository

This basically import the `retriever.py` file from the `release-services`
repository at hash 77cdddd. I removed imports and reference to cli-common
helpers that will likely needs to be reimplemented, like the raven support.

The next commit will defines some Dockerfiles that will use the imported file.

* Add docker image definition

Build three Docker image, one is for bugbug itself. It is just installing
bugbug and its dependencies.

One is for retrieving information from the mozilla-central Mercurial
repository, it depends on the first one and install the right Mercurial
version.

The last one is for retrieving information from the Bugzilla instance, it
depends in the first one and needs a valid Bugzilla token.

* Separate the two tasks into separate script files

They share almost no code at all so they don't need to be in the same file

* Apply Black on the scripts to makes Flake8 happy
2019-04-09 16:30:09 +02:00