Marco Castelluccio
6f489969bf
Version 0.0.58
2019-07-02 12:28:10 +02:00
Marco Castelluccio
96ebe8777a
Add a script to classify a patch
2019-07-02 12:21:35 +02:00
Marco Castelluccio
f9f59ef863
Add functions to clone and clean mozilla-central
2019-07-02 12:21:35 +02:00
Marco Castelluccio
8ef65a186e
Don't raise an exception when there are no commits to analyze
2019-07-02 12:21:35 +02:00
Marco Castelluccio
8081b722cf
Be more verbose when a day is too early
2019-07-02 12:21:35 +02:00
Marco Castelluccio
8961e22059
When the pushdate is not available because the commit is local only, use 'now' as the pushdate
2019-07-02 12:21:35 +02:00
Ayush Shridhar
1793c48c0b
Support using bigrams for NearestNeighbors similarity ( #654 )
2019-07-01 18:33:49 +02:00
Marco Castelluccio
80aa5b01b8
Generate feature importance force plot at classification time
2019-07-01 13:10:39 +02:00
Marco Castelluccio
d12a25f644
Upload feature visualization image as an artifact of the training tasks
2019-07-01 13:10:39 +02:00
Marco Castelluccio
b093164b3a
Generate feature importance summary plot after training
2019-07-01 13:10:39 +02:00
x249wang
30d19efb5b
Remove fallback on downloading xz files ( #644 )
2019-07-01 11:58:20 +02:00
Yun Seob Lee
f836f25291
Add basic test for DevDocNeeded model ( #661 )
...
Fixes #435
2019-07-01 11:46:52 +02:00
pyup.io bot
c187007c73
Update pytest from 4.6.3 to 5.0.0 ( #660 )
2019-07-01 11:45:39 +02:00
Marco Castelluccio
ed8da961f1
Rollback comment edits too
...
Fixes #650
2019-06-29 01:34:21 +02:00
Marco Castelluccio
270335bb27
Ignore comment tag changes as we don't store comment tags in bug data yet
2019-06-29 01:34:21 +02:00
Boris Feld
7459f79317
Use the base image for training models ( #656 )
...
Fixes #350
2019-06-29 00:01:51 +02:00
Boris Feld
28e1daf526
Avoid putting the version in the env ( #655 )
...
Fixes #570
2019-06-28 23:55:28 +02:00
pyup.io bot
7a2fbe5ac9
Update imbalanced-learn from 0.4.3 to 0.5.0 ( #657 )
2019-06-28 23:53:02 +02:00
Boris Feld
d24993d0ac
Remove dependency on rollbacktest in docker build. ( #653 )
...
Fixes #651
2019-06-28 15:32:39 +02:00
Boris Feld
2977a8af89
Start tracking the decision task of the data pipeline ( #649 )
...
This way we can automate the stop of a running data pipeline during the
rollback process.
2019-06-28 15:32:04 +02:00
Marco Castelluccio
7ffb419d82
Version 0.0.57
2019-06-28 11:24:54 +02:00
Boris Feld
54e41d1497
Use taskboot 0.1.8 ( #645 )
...
The new taskboot release solves the double build on non-tag commits and
allows the heroku deploy to be fully atomic.
2019-06-28 11:11:48 +02:00
Boris Feld
f999b3ffdf
Track imbalance report metrics too ( #639 )
...
Fixes #619
2019-06-27 18:42:51 +02:00
Boris Feld
2b48e153cb
Use tagged version in bugbug hooks ( #640 )
...
Fixes #634
2019-06-27 15:47:57 +02:00
Boris Feld
89bba8efca
Move log messages to stderr ( #635 )
...
As the retrieve script can output the metrics on the standard output, log
messages would pollute the output and complicate scripts that would want to
parse it. Use logging instead of passing stderr to the print statements as
it's mostly the same amount of code.
2019-06-27 10:58:07 +02:00
Marco Castelluccio
53ffd31a96
Add an option to avoid storing the commits data in the DB
2019-06-27 10:37:28 +02:00
Marco Castelluccio
16ece06f64
Retry git operations multiple times
2019-06-27 10:25:38 +02:00
Marco Castelluccio
a3933a48a4
Don't fail if there's an error while pulling from the repo
2019-06-27 10:25:38 +02:00
Marco Castelluccio
647504ac98
Version 0.0.56
2019-06-27 01:26:29 +02:00
Marco Castelluccio
eb8f847d43
Install 'git' package in the commit_retrieval image
...
Even though it's only required for the annotate pipeline.
2019-06-27 01:25:58 +02:00
Marco Castelluccio
f65bf19ab8
Disable pip cache when installing Mercurial
2019-06-27 01:25:22 +02:00
Marco Castelluccio
afac5cbdb6
Remove apt lists after installing Python 2
2019-06-27 01:24:59 +02:00
Marco Castelluccio
ad67d76992
Use bugbug-commit-retrieval image for the microannotate-generate task, as need Mercurial to be installed
2019-06-27 01:14:32 +02:00
Marco Castelluccio
4a2759e9b0
Add TC_SECRET_ID env variable to microannotate-generate task
2019-06-27 01:13:51 +02:00
Marco Castelluccio
8fd3be99ce
Command must be split in two lines
2019-06-26 22:28:40 +02:00
Marco Castelluccio
1d3647ba78
Version 0.0.55
2019-06-26 21:50:13 +02:00
Marco Castelluccio
512e54c95b
Add more defect/enhancement/task labels gathered from changed made by users on Bugzilla
2019-06-26 21:49:59 +02:00
Marco Castelluccio
b969a4576c
Fallback on xz URL for Last-Modified if the zst URL is not available
2019-06-26 21:46:40 +02:00
Marco Castelluccio
252e29398d
command should be an array
2019-06-26 21:01:31 +02:00
Marco Castelluccio
b177cfad6e
Version 0.0.54
2019-06-26 20:06:11 +02:00
Marco Castelluccio
a05096541d
Don't try to extract files that are not compressed
...
Or we will fail downloading the DB version.
Also add a test for download_version and is_old_version.
2019-06-26 19:59:30 +02:00
Marco Castelluccio
4b56be37a9
Add missing secrets scope to annotate hook
2019-06-26 19:45:44 +02:00
Marco Castelluccio
b01eeafe44
Add annotate-pipeline.yml file to the spawn_pipeline Docker image
2019-06-26 19:45:22 +02:00
Marco Castelluccio
ea7a009450
Fix schedule value
2019-06-26 19:40:19 +02:00
Marco Castelluccio
76b27b8550
Version 0.0.53
2019-06-26 19:17:03 +02:00
Marco Castelluccio
56f224b9dc
Generate microannotate repository for mozilla-central
2019-06-26 18:57:36 +02:00
Marco Castelluccio
4c20645e04
Install required json-e and pyyaml requirements for tests
2019-06-26 18:43:15 +02:00
Marco Castelluccio
bcf405c747
Ensure pipeline yaml files are valid YAML and can be rendered with JSON-e
...
Part of #613
2019-06-26 18:35:46 +02:00
Ayush Shridhar
6788b2e33a
Make similarity script more generic and add nearest neighbors similarity with tf-idf encoding ( #628 )
2019-06-26 13:42:23 +02:00
Marco Castelluccio
bd118c58ab
Use with statement for hg.open
2019-06-26 11:45:02 +02:00