Граф коммитов

35 Коммитов

Автор SHA1 Сообщение Дата
abstrekt ecbf8c9ca2 Add main function to scripts/regressor_finder.py (#940)
scripts/regressor_finder.py was missed by 6e1479b

This finally fixes #831
2019-09-09 23:00:17 +02:00
Anurag Aggarwal f222110409 Clone repositories in parallel (#849)
Fixes #826
2019-08-07 00:55:45 +02:00
Marco Castelluccio 1bc24bdd20 Store results in the DB as soon as they are ready 2019-08-05 13:14:05 +02:00
Marco Castelluccio bff23bfbc3 Store GitRepository instance in thread-local storage 2019-08-05 13:13:30 +02:00
Lana Honcharuk 7a7d1186bd Move zstandard compression and decompression into utils module (#850) 2019-08-04 15:29:06 +02:00
Marco Castelluccio f045e235d4 Revert "Disable multithreaded regressor finder temporarily"
This reverts commit 9b11016b3e.
2019-08-03 00:48:29 +02:00
Marco Castelluccio 9b11016b3e Disable multithreaded regressor finder temporarily
Until we figure out why it's hanging on Taskcluster.
2019-08-02 17:03:02 +02:00
Marco Castelluccio 6228149774 Be more verbose when finding regressor 2019-08-02 15:27:49 +02:00
Marco Castelluccio b4c4bce97f Log mercurial revision instead of git one 2019-08-02 12:58:03 +02:00
Marco Castelluccio 94045bac86 Analyze more recent commits first 2019-08-01 19:44:45 +02:00
Marco Castelluccio c9d4cbe595 Analyze up to 500 commits at a time to find regressors 2019-08-01 19:43:57 +02:00
calixteman c6367f0809 Swap experience calculation and transform (#820)
This way we can stop parsing the modified files when getting the log.

Fixes #422.
2019-08-01 17:06:50 +02:00
Marco Castelluccio 0b7de40a19 Only show stats about analyzed bug fixing commits 2019-07-30 22:31:14 +02:00
Marco Castelluccio 29ed2b4ed8 Run SZZ on up to 1000 commits at a time, to avoid the task running out of time 2019-07-29 18:22:11 +02:00
Marco Castelluccio a0997cb17a Find regressors in a timespan of 6 months for now
Depending on how long it takes, we can implement progressive
regressor finding or just do everything (2 years and 6 months)
in the same task.
2019-07-29 12:30:28 +02:00
Marco Castelluccio 0a5cf9d2f1 Skip already done commits only when there are some already done commits 2019-07-27 09:20:47 +02:00
Marco Castelluccio d55ef055c6 Retrieve commits to ignore only until a given date
Needed because otherwise we could fail when trying to map
the mercurial hashes to git hashes.
2019-07-27 02:05:11 +02:00
Marco Castelluccio 73633e2e00 Make DB version handling saner to use
Fixing a problem where the retriever tasks would always write a version file
containing the version of the old DB.
2019-07-26 23:37:46 +02:00
Marco Castelluccio 60a979be9d Store commits to ignore in a bugbug DB and generate them progressively
In the future, we will be able to get commits to ignore directly from the normal commits DB
generated by bugbug/repositor.py.
2019-07-26 18:14:57 +02:00
Marco Castelluccio 633a2c4d3c Ingore commits that are not in the VCS map
This means they are older than "Free the lizard" (3b56a9af51519d2e77e05efa672a13e6be2e9ebc).
2019-07-26 11:57:26 +02:00
Marco Castelluccio c42b1cc7bb Don't skip commits with no mirror in the tokenized repo when analyzing the normal repo 2019-07-25 18:53:23 +02:00
Marco Castelluccio 99263864fd Revert "Increase version number of the DBs, as we changed the format"
This reverts commit cb5c96fd48.
2019-07-25 18:09:13 +02:00
Marco Castelluccio cb5c96fd48 Increase version number of the DBs, as we changed the format 2019-07-25 16:08:29 +02:00
Marco Castelluccio f0da3b5b21 Clone tokenized git repository too 2019-07-25 10:38:06 +02:00
Marco Castelluccio de212602fc Add a method to evaluate the SZZ results comparing them with the regressed-by information
Fixes #772
2019-07-25 01:49:20 +02:00
Marco Castelluccio 2dc1a17a75 Remove outdated comment 2019-07-24 22:16:22 +02:00
Marco Castelluccio 697a5c8189 Only store mercurial revisions
Users of the resulting DBs will take care of converting to git if they need
2019-07-24 22:15:04 +02:00
Marco Castelluccio 22d73e3637 Apply regressor finder also on the microannotated repository with comments removed
Fixes #627
2019-07-24 22:15:04 +02:00
Marco Castelluccio 839ebf8fcf Make git repo URL a parameter, so we can find regressors using different git repositories 2019-07-24 21:01:53 +02:00
Marco Castelluccio f972646819 Introduce a RegressorFinder class, and ignore mercurial<->git mapping errors 2019-07-24 21:01:53 +02:00
Marco Castelluccio fbaef0661d Store regressor finder results in bugbug DBs and make it run only on commits which haven't been analyzed yet 2019-07-23 02:14:22 +02:00
Marco Castelluccio 413da2b87e More logging in regressor finder script 2019-07-22 23:20:01 +02:00
Marco Castelluccio 9fd44dd19f Use os.cpu_count() instead of multiprocessing.cpu_count() 2019-07-22 23:20:01 +02:00
Marco Castelluccio 8151b85873 Add missing f in f-string 2019-07-22 15:54:57 +02:00
Marco 77ec8b529d
Add a WIP script to find bug-introducing commits (#748)
* Install depot_tools in the commit retrieval image

* Add a WIP script to find bug-introducing commits

* Add a task which runs the bug-introducing commits finder script
2019-07-22 14:41:34 +02:00