Marco Castelluccio
80e4baea23
Specify which version of Mercurial is supported, pointing to the Docker file
...
We will likely forget to update it if we specify the version number, so
it's better to point to the commit-retriever Docker file instead.
Refs #812
2019-07-31 16:11:19 +02:00
Marco Castelluccio
d51de39579
Specify where databases are automatically downloaded
...
Refs #812
2019-07-31 16:08:30 +02:00
Marco Castelluccio
7e5e4a8e02
Explain that some requirements might be in other requirement files
...
Refs #812
2019-07-31 16:07:51 +02:00
Marco Castelluccio
b2db1fd018
Perform the python->python2 modification in the git hyper-blame script directly as part of the patch
2019-07-31 11:40:37 +02:00
Marco Castelluccio
0b7de40a19
Only show stats about analyzed bug fixing commits
2019-07-30 22:31:14 +02:00
Marco Castelluccio
bee400866a
Log in which transformation step we are in
2019-07-30 17:36:14 +02:00
Ayush Shridhar
c81accf493
Word2Vec with soft-cosine similarity ( #810 )
2019-07-30 15:59:50 +02:00
cklyyung
5b0b403875
Remove linting task comment about only running on modified files ( #791 )
...
Given that the time savings are negligible.
Closes #298 .
2019-07-30 14:18:30 +02:00
Marco Castelluccio
63dddccb91
Version 0.0.87
2019-07-30 11:15:46 +02:00
dependabot-preview[bot]
d75aeeac4e
Bump microannotate from 0.0.10 to 0.0.11 ( #809 )
...
Bump [microannotate]() from 0.0.10 to 0.0.11.
2019-07-30 11:14:25 +02:00
dependabot-preview[bot]
67a6840c8f
Bump scikit-learn from 0.21.2 to 0.21.3 ( #808 )
...
Bump [scikit-learn](https://github.com/scikit-learn/scikit-learn ) from 0.21.2 to 0.21.3.
- [Release notes](https://github.com/scikit-learn/scikit-learn/releases )
- [Commits](https://github.com/scikit-learn/scikit-learn/compare/0.21.2...0.21.3 )
2019-07-30 10:00:53 +02:00
Marco Castelluccio
37c93121b0
Number of commits to transform should be an integer
2019-07-30 09:53:35 +02:00
Marco
d8155c0028
Use revision title when the commit message is missing from the diff commit ( #803 )
2019-07-30 09:51:05 +02:00
Marco Castelluccio
61b77e4085
Version 0.0.86
2019-07-29 18:30:44 +02:00
Marco Castelluccio
29ed2b4ed8
Run SZZ on up to 1000 commits at a time, to avoid the task running out of time
2019-07-29 18:22:11 +02:00
dependabot-preview[bot]
a8a80b48db
Bump coverage from 4.5.3 to 4.5.4 ( #807 )
...
Bump [coverage](https://github.com/nedbat/coveragepy ) from 4.5.3 to 4.5.4.
- [Release notes](https://github.com/nedbat/coveragepy/releases )
- [Changelog](https://github.com/nedbat/coveragepy/blob/master/CHANGES.rst )
- [Commits](https://github.com/nedbat/coveragepy/compare/coverage-4.5.3...coverage-4.5.4 )
2019-07-29 18:16:28 +02:00
Marco Castelluccio
8a7840aa0f
Push commits more quickly to the mirror repositories to avoid wasted work
2019-07-29 17:37:13 +02:00
Ayush Shridhar
7afccd64b9
Raise NotImplementedError in get_distance methods of Similarity models that don't implement it yet ( #805 )
2019-07-29 15:07:36 +02:00
Ayush Shridhar
88b754261e
Add an option to use nltk's tokenizer instead of plain split() ( #718 )
2019-07-29 14:13:13 +02:00
Marco Castelluccio
45960125c3
Version 0.0.85
2019-07-29 12:40:04 +02:00
Marco Castelluccio
a0997cb17a
Find regressors in a timespan of 6 months for now
...
Depending on how long it takes, we can implement progressive
regressor finding or just do everything (2 years and 6 months)
in the same task.
2019-07-29 12:30:28 +02:00
Ayush Shridhar
84687c8f4a
Add a method to calculate the distance between two bugs ( #798 )
2019-07-29 12:07:32 +02:00
Boris Feld
afd67402e2
Fix copy-paste typo with the new indexing schema ( #801 )
2019-07-28 20:38:05 +02:00
Marco Castelluccio
40d4296900
Version 0.0.84
2019-07-27 09:21:16 +02:00
Marco Castelluccio
0a5cf9d2f1
Skip already done commits only when there are some already done commits
2019-07-27 09:20:47 +02:00
Marco Castelluccio
edd3180e20
Version 0.0.83
2019-07-27 02:09:05 +02:00
Marco Castelluccio
d55ef055c6
Retrieve commits to ignore only until a given date
...
Needed because otherwise we could fail when trying to map
the mercurial hashes to git hashes.
2019-07-27 02:05:11 +02:00
Marco Castelluccio
4e39a619ed
Consider remote DB as old if the remote version file returns any HTTP error
2019-07-27 02:05:11 +02:00
Marco Castelluccio
3747171716
Consider remote DB old if there is no version file yet
2019-07-27 02:05:11 +02:00
Marco Castelluccio
785aeca537
Support getting revisions before a given revision
2019-07-27 02:05:11 +02:00
Marco Castelluccio
aebbdb9e43
Create DB's parent directory without checking if it exists first
2019-07-26 23:37:46 +02:00
Marco Castelluccio
059589d3b9
Switch back to version 1 for the commits DB
...
We weren't writing the version correctly and we were just rerunning
everything every time, so we can just use version 1 now.
The bug was fixed by 2050cc09d99f1488a57494125a16acbfab7e44c2.
2019-07-26 23:37:46 +02:00
Marco Castelluccio
73633e2e00
Make DB version handling saner to use
...
Fixing a problem where the retriever tasks would always write a version file
containing the version of the old DB.
2019-07-26 23:37:46 +02:00
dependabot-preview[bot]
c1129bfcea
Bump numpy from 1.16.4 to 1.17.0 ( #796 )
...
Bump [numpy](https://github.com/numpy/numpy ) from 1.16.4 to 1.17.0.
- [Release notes](https://github.com/numpy/numpy/releases )
- [Changelog](https://github.com/numpy/numpy/blob/master/doc/HOWTO_RELEASE.rst.txt )
- [Commits](https://github.com/numpy/numpy/compare/v1.16.4...v1.17.0 )
2019-07-26 22:23:37 +02:00
Marco Castelluccio
ecfbed86d2
Version 0.0.82
2019-07-26 18:31:17 +02:00
Boris Feld
a43ad03b2a
Add a new indexing schema for training tasks ( #795 )
...
In order to efficiently solve #614 , we need a new indexing schema
so getting all metrics following a given date is easy.
2019-07-26 18:28:04 +02:00
Marco Castelluccio
60a979be9d
Store commits to ignore in a bugbug DB and generate them progressively
...
In the future, we will be able to get commits to ignore directly from the normal commits DB
generated by bugbug/repositor.py.
2019-07-26 18:14:57 +02:00
Marco Castelluccio
4b6dc08b15
Use hglib context manager when possible
2019-07-26 18:13:54 +02:00
Boris Feld
b184105453
Clean cached predictions when a bug gets updated ( #786 )
...
Fixes #765
2019-07-26 16:47:54 +02:00
Marco Castelluccio
38db5e19dc
Fix regressor-finder artifact paths
2019-07-26 16:09:50 +02:00
Marco Castelluccio
15fe853f58
Remove unneeded parentheses
2019-07-26 15:23:14 +02:00
Marco Castelluccio
1511288156
Version 0.0.81
2019-07-26 14:36:29 +02:00
Marco Castelluccio
19e258c0d0
Rollback 'question' flags too when possible
2019-07-26 14:27:39 +02:00
Marco Castelluccio
cbdabff3ad
Add a few more known inconsistencies
2019-07-26 13:02:12 +02:00
Marco Castelluccio
21705f049d
Add other mappings between old and new keywords and platforms and products
2019-07-26 13:01:38 +02:00
Marco Castelluccio
633a2c4d3c
Ingore commits that are not in the VCS map
...
This means they are older than "Free the lizard" (3b56a9af51519d2e77e05efa672a13e6be2e9ebc).
2019-07-26 11:57:26 +02:00
Marco Castelluccio
62990ec235
Version 0.0.80
2019-07-26 00:54:31 +02:00
Harshit chittora
76df9a734c
Fix labels in confusion matrix for multi-label models ( #787 )
2019-07-26 00:07:25 +02:00
Marco Castelluccio
964250c01c
Update the version of the commits DB, so it gets regenerated
...
After the fix from adb39df6bb
.
Fixes #785
2019-07-25 23:57:57 +02:00
Marco Castelluccio
adb39df6bb
Switch to a version-control-tools version fixing an error with the 'bug' mercurial template
2019-07-25 23:56:38 +02:00