abstrekt
ecbf8c9ca2
Add main function to scripts/regressor_finder.py ( #940 )
...
scripts/regressor_finder.py was missed by 6e1479b
This finally fixes #831
2019-09-09 23:00:17 +02:00
Anurag Aggarwal
f222110409
Clone repositories in parallel ( #849 )
...
Fixes #826
2019-08-07 00:55:45 +02:00
Marco Castelluccio
1bc24bdd20
Store results in the DB as soon as they are ready
2019-08-05 13:14:05 +02:00
Marco Castelluccio
bff23bfbc3
Store GitRepository instance in thread-local storage
2019-08-05 13:13:30 +02:00
Lana Honcharuk
7a7d1186bd
Move zstandard compression and decompression into utils module ( #850 )
2019-08-04 15:29:06 +02:00
Marco Castelluccio
f045e235d4
Revert "Disable multithreaded regressor finder temporarily"
...
This reverts commit 9b11016b3e
.
2019-08-03 00:48:29 +02:00
Marco Castelluccio
9b11016b3e
Disable multithreaded regressor finder temporarily
...
Until we figure out why it's hanging on Taskcluster.
2019-08-02 17:03:02 +02:00
Marco Castelluccio
6228149774
Be more verbose when finding regressor
2019-08-02 15:27:49 +02:00
Marco Castelluccio
b4c4bce97f
Log mercurial revision instead of git one
2019-08-02 12:58:03 +02:00
Marco Castelluccio
94045bac86
Analyze more recent commits first
2019-08-01 19:44:45 +02:00
Marco Castelluccio
c9d4cbe595
Analyze up to 500 commits at a time to find regressors
2019-08-01 19:43:57 +02:00
calixteman
c6367f0809
Swap experience calculation and transform ( #820 )
...
This way we can stop parsing the modified files when getting the log.
Fixes #422 .
2019-08-01 17:06:50 +02:00
Marco Castelluccio
0b7de40a19
Only show stats about analyzed bug fixing commits
2019-07-30 22:31:14 +02:00
Marco Castelluccio
29ed2b4ed8
Run SZZ on up to 1000 commits at a time, to avoid the task running out of time
2019-07-29 18:22:11 +02:00
Marco Castelluccio
a0997cb17a
Find regressors in a timespan of 6 months for now
...
Depending on how long it takes, we can implement progressive
regressor finding or just do everything (2 years and 6 months)
in the same task.
2019-07-29 12:30:28 +02:00
Marco Castelluccio
0a5cf9d2f1
Skip already done commits only when there are some already done commits
2019-07-27 09:20:47 +02:00
Marco Castelluccio
d55ef055c6
Retrieve commits to ignore only until a given date
...
Needed because otherwise we could fail when trying to map
the mercurial hashes to git hashes.
2019-07-27 02:05:11 +02:00
Marco Castelluccio
73633e2e00
Make DB version handling saner to use
...
Fixing a problem where the retriever tasks would always write a version file
containing the version of the old DB.
2019-07-26 23:37:46 +02:00
Marco Castelluccio
60a979be9d
Store commits to ignore in a bugbug DB and generate them progressively
...
In the future, we will be able to get commits to ignore directly from the normal commits DB
generated by bugbug/repositor.py.
2019-07-26 18:14:57 +02:00
Marco Castelluccio
633a2c4d3c
Ingore commits that are not in the VCS map
...
This means they are older than "Free the lizard" (3b56a9af51519d2e77e05efa672a13e6be2e9ebc).
2019-07-26 11:57:26 +02:00
Marco Castelluccio
c42b1cc7bb
Don't skip commits with no mirror in the tokenized repo when analyzing the normal repo
2019-07-25 18:53:23 +02:00
Marco Castelluccio
99263864fd
Revert "Increase version number of the DBs, as we changed the format"
...
This reverts commit cb5c96fd48
.
2019-07-25 18:09:13 +02:00
Marco Castelluccio
cb5c96fd48
Increase version number of the DBs, as we changed the format
2019-07-25 16:08:29 +02:00
Marco Castelluccio
f0da3b5b21
Clone tokenized git repository too
2019-07-25 10:38:06 +02:00
Marco Castelluccio
de212602fc
Add a method to evaluate the SZZ results comparing them with the regressed-by information
...
Fixes #772
2019-07-25 01:49:20 +02:00
Marco Castelluccio
2dc1a17a75
Remove outdated comment
2019-07-24 22:16:22 +02:00
Marco Castelluccio
697a5c8189
Only store mercurial revisions
...
Users of the resulting DBs will take care of converting to git if they need
2019-07-24 22:15:04 +02:00
Marco Castelluccio
22d73e3637
Apply regressor finder also on the microannotated repository with comments removed
...
Fixes #627
2019-07-24 22:15:04 +02:00
Marco Castelluccio
839ebf8fcf
Make git repo URL a parameter, so we can find regressors using different git repositories
2019-07-24 21:01:53 +02:00
Marco Castelluccio
f972646819
Introduce a RegressorFinder class, and ignore mercurial<->git mapping errors
2019-07-24 21:01:53 +02:00
Marco Castelluccio
fbaef0661d
Store regressor finder results in bugbug DBs and make it run only on commits which haven't been analyzed yet
2019-07-23 02:14:22 +02:00
Marco Castelluccio
413da2b87e
More logging in regressor finder script
2019-07-22 23:20:01 +02:00
Marco Castelluccio
9fd44dd19f
Use os.cpu_count() instead of multiprocessing.cpu_count()
2019-07-22 23:20:01 +02:00
Marco Castelluccio
8151b85873
Add missing f in f-string
2019-07-22 15:54:57 +02:00
Marco
77ec8b529d
Add a WIP script to find bug-introducing commits ( #748 )
...
* Install depot_tools in the commit retrieval image
* Add a WIP script to find bug-introducing commits
* Add a task which runs the bug-introducing commits finder script
2019-07-22 14:41:34 +02:00