Граф коммитов

14 Коммитов

Автор SHA1 Сообщение Дата
Marco 025e3f4da2
Fix command to train defect/enhancement/task model (#476)
* Fix command to train defect/enhancement/task model

Fixes #475

* Add more logging in the trainer script, and assert the model is generated
2019-05-21 14:46:57 +02:00
Boris Feld dd00d7b9ec Add the support for downloading the model before checking it (#452)
Also put the right configuration in the check pipeline
2019-05-17 11:45:42 +02:00
Boris Feld 0a5e37439d Add a central place where the models are defined (#398)
* Add a central place where the models are defined

Also add some helpers to load a model.

* Add missing tensorflow dependency in extra-nn-requirements.txt
2019-05-16 15:34:38 +02:00
Marco 2d249793e2
Try regenerating the pushlog using pull and update (#444) 2019-05-16 11:33:14 +02:00
Marco 9223954520
Remove training tasks' unneeded dependencies on commit retrieval task (#407)
Fixes #390
2019-05-14 15:22:44 +02:00
Boris Feld f4b2b938be
Add basic check method and check script (#341)
* Add basic check method and check script

* Ensure the check of component will correctly use super result

* Add required infra to schedule model checks

* Add scheduling bits for the model checks

* Remove the filtering on classification

* Extract counting bugs to a new function in bugzilla.py

* Also checks conflated components

* Fix new hook id

* Call bugzilla with the count_only param to speed up the check

* Fix the new hook scope to match the hook id

* Fix component model check after previous refactoring

* Fix component model check method

* Use a bugzilla report for even faster component model check

* Clarify get_product_component_count docstring

We are already filtering out full component with 0 bugs

* Update conflated components mapping check

A conflated component could also be part of the conflated components mapping

* Distinguish between non-existing full components and empty full components

* Remove the filter on resolution and unnecessary url params

* Update component check method

Keep checks as separate as possible for clarity, we could merge them or makes
them faster later

* Generate dynamically the CSV report url

* Fix Docker image name the hook

* Implement component check number 5

Get the meaningful components for the last 6 months

* Handle reviews comments

* Remove extraneous print

* Removes TODO

* Use a different threshold ration when checking for new meaningful components

As we are only checking new bugs for 6 months, adjust the threshold ration to
be less sensitive to occasional burst ob bugs for q given component.

* Reduce the threshold ratio

As we check on a disjoint time window, reduce the chance of false positives

* Handle review nits

* Fix last nits
2019-05-10 12:20:23 +02:00
Boris Feld 369b44ea02 Update the index URLs in bugbug (#328)
* Update the index URLs in bugbug

* Split the http service Docker image in two

This way we can both:
- Build the first half (code + dependencies) in the usual CI.
- Build the second half at the end of the data pipeline with updated models.

Taskboot build-compose doesn't support building all services except a
specific one and it might be cumbersome to add this feature so move the second
half of the Docker image to a separate docker-compose file.
2019-05-02 17:00:32 +02:00
Marco Castelluccio 3105acef95 Add script to gather defect/enhancement/task labels 2019-04-24 14:15:40 +02:00
Boris Feld 4b55b7f4f3 Add support to get secrets from taskcluster (#294) 2019-04-19 16:49:07 +02:00
Boris Feld 6af6e8b927 Import Trainer class from release-services repository (#254)
* Import Trainer class from release-services repository

This basically import the `trainer.py` file from the `release-services`
repository at hash 77cdddd. I removed imports and reference to cli-common
helpers that will likely need to be reimplemented, like the raven support.

Also defines 4 docker images, one per model to train.

* Remove unused imports
2019-04-09 17:49:56 +02:00
Boris Feld b651744b18 Import retriever services and add Docker image definition (#251)
* Import Retriever class from release-services repository

This basically import the `retriever.py` file from the `release-services`
repository at hash 77cdddd. I removed imports and reference to cli-common
helpers that will likely needs to be reimplemented, like the raven support.

The next commit will defines some Dockerfiles that will use the imported file.

* Add docker image definition

Build three Docker image, one is for bugbug itself. It is just installing
bugbug and its dependencies.

One is for retrieving information from the mozilla-central Mercurial
repository, it depends on the first one and install the right Mercurial
version.

The last one is for retrieving information from the Bugzilla instance, it
depends in the first one and needs a valid Bugzilla token.

* Separate the two tasks into separate script files

They share almost no code at all so they don't need to be in the same file

* Apply Black on the scripts to makes Flake8 happy
2019-04-09 16:30:09 +02:00
Boris Feld bad6a50d8b Pre commit setup (#252)
* Add pre-commit configuration

Add auto-formatting configuration using the https://pre-commit.com/ project.
Having auto-formatting setup and automatically enforced helps speeding up
development and review process.

* Apply the auto-formatting on all files in the repository

* Removes flake8-quotes as it conflicts with Black formatting

* Disable some Flake8 rules

Disable Flake8 rules that are handled by Black. The list comes from
https://github.com/ambv/black/issues/429#issuecomment-472687803.
2019-04-09 15:57:29 +02:00
Marco Castelluccio 41f1aa3b1e Calculate important components based on their past occurrences rather than having a hardcoded list
Fixes #220
2019-03-18 20:18:25 +01:00
John Giannelos d29621b84d Add script to compute success rate for component models (#190) 2019-02-26 15:16:39 +01:00