Граф коммитов

588 Коммитов

Автор SHA1 Сообщение Дата
Ayush Shridhar 3000a974e6 Move Contributing section to CONTRIBUTING.md (#406)
Also add docs about bug type classifier to README.md
2019-05-14 15:51:31 +02:00
Marco 9223954520
Remove training tasks' unneeded dependencies on commit retrieval task (#407)
Fixes #390
2019-05-14 15:22:44 +02:00
Ayush Shridhar add9a937b3 Multilabel classifier for detecting type of bug (#395) 2019-05-14 12:17:53 +02:00
Marco 97d514e1db
Collect size of files modified by the patch (#394)
This can be considered as one of the complexity measures (see #372)
2019-05-14 11:42:13 +02:00
Marco ed992d57ec
Initial tests for repository module (#377)
* Test repository.get_revs function

* Test repository.get_directories function

* Split _hg_log function called by ProcessPoolExecutor in two to make it more easily testable

* Add logging when downloading file->component mapping

* Move the experience calculation code in a separate function

* Don't break experiences when there are days without commits

* Add tests for repository.calculate_experiences

Fixes #382

*  When a commit changes multiple files in the same component, don't overcount the experience

The commit itself was being considered as a previous commit touching the same components
2019-05-14 11:30:56 +02:00
pyup.io bot f6082dbb57 Update tqdm from 4.32.0 to 4.32.1 (#402) 2019-05-14 01:40:38 +02:00
Marco Castelluccio 2f5c47af1e Store pushdate too in commit DB 2019-05-14 01:28:26 +02:00
Marco Castelluccio 1edfa395cf Add links to good-first-bug and reserved-for-beginners issues 2019-05-14 01:28:16 +02:00
pyup.io bot b67d7b2381 Update tqdm from 4.31.1 to 4.32.0 (#401) 2019-05-14 00:29:49 +02:00
Marco Castelluccio be84bff898 Change good-first-bug label name in README 2019-05-13 17:40:37 +02:00
Marco c4bd01278e
Add 'expires' to all tasks to avoid them expiring in a too long time (#393)
Fixes #391.
2019-05-12 21:46:58 +02:00
pyup.io bot 6f7c5b4cde Update spacy from 2.1.3 to 2.1.4 (#396) 2019-05-12 12:17:30 +02:00
pyup.io bot a4ebec8c59 Update pytest from 4.4.2 to 4.5.0 (#397) 2019-05-12 12:17:13 +02:00
Marco e3230ca999
Increase deadline of data pipeline tasks (#389)
Fixes #388.
2019-05-10 16:12:46 +02:00
Boris Feld f4b2b938be
Add basic check method and check script (#341)
* Add basic check method and check script

* Ensure the check of component will correctly use super result

* Add required infra to schedule model checks

* Add scheduling bits for the model checks

* Remove the filtering on classification

* Extract counting bugs to a new function in bugzilla.py

* Also checks conflated components

* Fix new hook id

* Call bugzilla with the count_only param to speed up the check

* Fix the new hook scope to match the hook id

* Fix component model check after previous refactoring

* Fix component model check method

* Use a bugzilla report for even faster component model check

* Clarify get_product_component_count docstring

We are already filtering out full component with 0 bugs

* Update conflated components mapping check

A conflated component could also be part of the conflated components mapping

* Distinguish between non-existing full components and empty full components

* Remove the filter on resolution and unnecessary url params

* Update component check method

Keep checks as separate as possible for clarity, we could merge them or makes
them faster later

* Generate dynamically the CSV report url

* Fix Docker image name the hook

* Implement component check number 5

Get the meaningful components for the last 6 months

* Handle reviews comments

* Remove extraneous print

* Removes TODO

* Use a different threshold ration when checking for new meaningful components

As we are only checking new bugs for 6 months, adjust the threshold ration to
be less sensitive to occasional burst ob bugs for q given component.

* Reduce the threshold ratio

As we check on a disjoint time window, reduce the chance of false positives

* Handle review nits

* Fix last nits
2019-05-10 12:20:23 +02:00
pyup.io bot 9fa9ecec2c Update scikit-learn from 0.20.3 to 0.21.0 (#386) 2019-05-10 10:46:46 +02:00
Marco Castelluccio 0a6b9cee5d Version 0.0.23 2019-05-10 00:37:39 +02:00
Marco Castelluccio 2fd954d456 Retrieve commits from first to last, instead of from last to first
As we are relying on this order in the analysis
2019-05-10 00:20:17 +02:00
Marco 6f09488573
Rename mozilla/bugbug-train-defect image to mozilla/bugbug-train-defectenhancementtask (#375)
Fixes #364.
2019-05-09 23:36:38 +02:00
Marco Castelluccio 87bfdbbde7 Version 0.0.22 2019-05-09 22:57:54 +02:00
Marco Castelluccio ca2abf0c9b Subtracted value should be EXPERIENCE_TIMESPAN and not EXPERIENCE_TIMESPAN_TEXT 2019-05-09 22:57:34 +02:00
Marco Castelluccio d28b1be712 Version 0.0.21 2019-05-09 21:27:09 +02:00
Marco 32714305e5
Optimize history features calculation for commits (#354)
* Retrieve pushdate too for commits

* Retrieve all commits for calculating history features

Keep storing only recent commits' data in the DB

Fixes #333

* Optimize history feature calculation, and fix overcounting of number of previous commits touching the same files/directories/components

Fixes #336

* Use UTC date for author date too
2019-05-09 19:30:20 +02:00
Ayush Shridhar c440db7315 Use re.compile to speedup feature cleanups (#351)
Fixes #338.
2019-05-09 15:09:26 +02:00
Boris Feld 4954fe38d4 Change scheduling of the data pipeline to run once every two weeks (#361)
Fixes #356.
2019-05-09 11:44:53 +02:00
Boris Feld 8eb5bcab92 Make docker push depends on test tasks (#363) 2019-05-09 11:40:59 +02:00
Boris Feld f68535c055 Update the hook before triggering the route (#362) 2019-05-09 11:27:12 +02:00
Marco a779560d37
Remove versioning support, as we are not really using it (#359) 2019-05-09 11:12:30 +02:00
pyup.io bot 33f5bbf2c0 Update pytest from 4.4.1 to 4.4.2 (#358) 2019-05-08 23:37:05 +02:00
Marco Castelluccio afc8553b14 Switch from Dropbox URLs to Taskcluster URLs
Fixes #257.
Fixes #352, indirectly.
2019-05-08 23:20:08 +02:00
Marco Castelluccio 54a6cbc93f Don't print error message about inconsistency if the inconsistency is due to the 'enhancement' severity
It is far too common, there is no point in logging it.
2019-05-08 22:35:26 +02:00
Marco Castelluccio b77081d8d7 Ignore some keywords that don't exist anymore 2019-05-08 22:34:55 +02:00
Marco Castelluccio e5456aa246 Ignore checkin-needed and other keywords, only when the field is actually 'keywords' 2019-05-08 22:34:32 +02:00
Marco Castelluccio eb681372e8 Ignore sec-review too 2019-05-08 22:34:05 +02:00
Marco Castelluccio 0d80939e36 Ignore some flags that have been removed 2019-05-08 22:33:51 +02:00
Marco Castelluccio 649433ed0a Add more bugs to the exception lists for inconsistent fields and changes 2019-05-08 22:33:22 +02:00
Marco Castelluccio a3b04890b4 cf_due_date field is None when it is an empty string in the history 2019-05-08 22:32:30 +02:00
Marco Castelluccio d34c87d2e5 Add SeaMonkey product to the mapping 2019-05-08 22:31:52 +02:00
Marco Castelluccio df9aa0563b Add bug-quality keyword to the mapping 2019-05-08 22:31:36 +02:00
Marco Castelluccio c829d20d67 When rolling back boolean fields, consider an empy string as False 2019-05-08 22:31:10 +02:00
Marco Castelluccio 2997a21f6d When rolling back comments and attachments, allow a few second diff between bug opening time and first comment/attachment 2019-05-08 22:30:15 +02:00
pyup.io bot 2e88e8cea7 Update pre-commit from 1.16.0 to 1.16.1 (#357) 2019-05-08 18:38:39 +02:00
pyup.io bot ed7ddc93ab Update gensim from 3.7.2 to 3.7.3 (#353) 2019-05-08 11:23:17 +02:00
Marco Castelluccio e78a513935 Version 0.0.20 2019-05-07 13:26:40 +02:00
Marco Castelluccio c3f55e682a Rename train-defect to train-defectenhancementtask 2019-05-07 13:16:22 +02:00
pyup.io bot 080c4b7d64 Update pre-commit from 1.15.2 to 1.16.0 (#346) 2019-05-07 12:16:54 +02:00
Marco Castelluccio 09c969498e Version 0.0.19 2019-05-07 11:54:01 +02:00
Marco Castelluccio 17c3e06d94 Remove undefined dependency on rollback_test_task
Fixes #348
2019-05-07 11:53:20 +02:00
Marco Castelluccio 352b23c240 Add more defect/enhancement/task labels gathered from changed made by users on Bugzilla 2019-05-07 11:40:43 +02:00
Marco Castelluccio 2eaf90be20 Add a cache to the commit retrieval task
Fixes #347
2019-05-07 11:38:02 +02:00