Граф коммитов

1442 Коммитов

Автор SHA1 Сообщение Дата
Marco Castelluccio 80e4baea23 Specify which version of Mercurial is supported, pointing to the Docker file
We will likely forget to update it if we specify the version number, so
it's better to point to the commit-retriever Docker file instead.

Refs #812
2019-07-31 16:11:19 +02:00
Marco Castelluccio d51de39579 Specify where databases are automatically downloaded
Refs #812
2019-07-31 16:08:30 +02:00
Marco Castelluccio 7e5e4a8e02 Explain that some requirements might be in other requirement files
Refs #812
2019-07-31 16:07:51 +02:00
Marco Castelluccio b2db1fd018 Perform the python->python2 modification in the git hyper-blame script directly as part of the patch 2019-07-31 11:40:37 +02:00
Marco Castelluccio 0b7de40a19 Only show stats about analyzed bug fixing commits 2019-07-30 22:31:14 +02:00
Marco Castelluccio bee400866a Log in which transformation step we are in 2019-07-30 17:36:14 +02:00
Ayush Shridhar c81accf493 Word2Vec with soft-cosine similarity (#810) 2019-07-30 15:59:50 +02:00
cklyyung 5b0b403875 Remove linting task comment about only running on modified files (#791)
Given that the time savings are negligible.

Closes #298.
2019-07-30 14:18:30 +02:00
Marco Castelluccio 63dddccb91 Version 0.0.87 2019-07-30 11:15:46 +02:00
dependabot-preview[bot] d75aeeac4e Bump microannotate from 0.0.10 to 0.0.11 (#809)
Bump [microannotate]() from 0.0.10 to 0.0.11.
2019-07-30 11:14:25 +02:00
dependabot-preview[bot] 67a6840c8f Bump scikit-learn from 0.21.2 to 0.21.3 (#808)
Bump [scikit-learn](https://github.com/scikit-learn/scikit-learn) from 0.21.2 to 0.21.3.
- [Release notes](https://github.com/scikit-learn/scikit-learn/releases)
- [Commits](https://github.com/scikit-learn/scikit-learn/compare/0.21.2...0.21.3)
2019-07-30 10:00:53 +02:00
Marco Castelluccio 37c93121b0 Number of commits to transform should be an integer 2019-07-30 09:53:35 +02:00
Marco d8155c0028
Use revision title when the commit message is missing from the diff commit (#803) 2019-07-30 09:51:05 +02:00
Marco Castelluccio 61b77e4085 Version 0.0.86 2019-07-29 18:30:44 +02:00
Marco Castelluccio 29ed2b4ed8 Run SZZ on up to 1000 commits at a time, to avoid the task running out of time 2019-07-29 18:22:11 +02:00
dependabot-preview[bot] a8a80b48db Bump coverage from 4.5.3 to 4.5.4 (#807)
Bump [coverage](https://github.com/nedbat/coveragepy) from 4.5.3 to 4.5.4.
- [Release notes](https://github.com/nedbat/coveragepy/releases)
- [Changelog](https://github.com/nedbat/coveragepy/blob/master/CHANGES.rst)
- [Commits](https://github.com/nedbat/coveragepy/compare/coverage-4.5.3...coverage-4.5.4)
2019-07-29 18:16:28 +02:00
Marco Castelluccio 8a7840aa0f Push commits more quickly to the mirror repositories to avoid wasted work 2019-07-29 17:37:13 +02:00
Ayush Shridhar 7afccd64b9 Raise NotImplementedError in get_distance methods of Similarity models that don't implement it yet (#805) 2019-07-29 15:07:36 +02:00
Ayush Shridhar 88b754261e Add an option to use nltk's tokenizer instead of plain split() (#718) 2019-07-29 14:13:13 +02:00
Marco Castelluccio 45960125c3 Version 0.0.85 2019-07-29 12:40:04 +02:00
Marco Castelluccio a0997cb17a Find regressors in a timespan of 6 months for now
Depending on how long it takes, we can implement progressive
regressor finding or just do everything (2 years and 6 months)
in the same task.
2019-07-29 12:30:28 +02:00
Ayush Shridhar 84687c8f4a Add a method to calculate the distance between two bugs (#798) 2019-07-29 12:07:32 +02:00
Boris Feld afd67402e2 Fix copy-paste typo with the new indexing schema (#801) 2019-07-28 20:38:05 +02:00
Marco Castelluccio 40d4296900 Version 0.0.84 2019-07-27 09:21:16 +02:00
Marco Castelluccio 0a5cf9d2f1 Skip already done commits only when there are some already done commits 2019-07-27 09:20:47 +02:00
Marco Castelluccio edd3180e20 Version 0.0.83 2019-07-27 02:09:05 +02:00
Marco Castelluccio d55ef055c6 Retrieve commits to ignore only until a given date
Needed because otherwise we could fail when trying to map
the mercurial hashes to git hashes.
2019-07-27 02:05:11 +02:00
Marco Castelluccio 4e39a619ed Consider remote DB as old if the remote version file returns any HTTP error 2019-07-27 02:05:11 +02:00
Marco Castelluccio 3747171716 Consider remote DB old if there is no version file yet 2019-07-27 02:05:11 +02:00
Marco Castelluccio 785aeca537 Support getting revisions before a given revision 2019-07-27 02:05:11 +02:00
Marco Castelluccio aebbdb9e43 Create DB's parent directory without checking if it exists first 2019-07-26 23:37:46 +02:00
Marco Castelluccio 059589d3b9 Switch back to version 1 for the commits DB
We weren't writing the version correctly and we were just rerunning
everything every time, so we can just use version 1 now.
The bug was fixed by 2050cc09d99f1488a57494125a16acbfab7e44c2.
2019-07-26 23:37:46 +02:00
Marco Castelluccio 73633e2e00 Make DB version handling saner to use
Fixing a problem where the retriever tasks would always write a version file
containing the version of the old DB.
2019-07-26 23:37:46 +02:00
dependabot-preview[bot] c1129bfcea Bump numpy from 1.16.4 to 1.17.0 (#796)
Bump [numpy](https://github.com/numpy/numpy) from 1.16.4 to 1.17.0.
- [Release notes](https://github.com/numpy/numpy/releases)
- [Changelog](https://github.com/numpy/numpy/blob/master/doc/HOWTO_RELEASE.rst.txt)
- [Commits](https://github.com/numpy/numpy/compare/v1.16.4...v1.17.0)
2019-07-26 22:23:37 +02:00
Marco Castelluccio ecfbed86d2 Version 0.0.82 2019-07-26 18:31:17 +02:00
Boris Feld a43ad03b2a Add a new indexing schema for training tasks (#795)
In order to efficiently solve #614, we need a new indexing schema
so getting all metrics following a given date is easy.
2019-07-26 18:28:04 +02:00
Marco Castelluccio 60a979be9d Store commits to ignore in a bugbug DB and generate them progressively
In the future, we will be able to get commits to ignore directly from the normal commits DB
generated by bugbug/repositor.py.
2019-07-26 18:14:57 +02:00
Marco Castelluccio 4b6dc08b15 Use hglib context manager when possible 2019-07-26 18:13:54 +02:00
Boris Feld b184105453 Clean cached predictions when a bug gets updated (#786)
Fixes #765
2019-07-26 16:47:54 +02:00
Marco Castelluccio 38db5e19dc Fix regressor-finder artifact paths 2019-07-26 16:09:50 +02:00
Marco Castelluccio 15fe853f58 Remove unneeded parentheses 2019-07-26 15:23:14 +02:00
Marco Castelluccio 1511288156 Version 0.0.81 2019-07-26 14:36:29 +02:00
Marco Castelluccio 19e258c0d0 Rollback 'question' flags too when possible 2019-07-26 14:27:39 +02:00
Marco Castelluccio cbdabff3ad Add a few more known inconsistencies 2019-07-26 13:02:12 +02:00
Marco Castelluccio 21705f049d Add other mappings between old and new keywords and platforms and products 2019-07-26 13:01:38 +02:00
Marco Castelluccio 633a2c4d3c Ingore commits that are not in the VCS map
This means they are older than "Free the lizard" (3b56a9af51519d2e77e05efa672a13e6be2e9ebc).
2019-07-26 11:57:26 +02:00
Marco Castelluccio 62990ec235 Version 0.0.80 2019-07-26 00:54:31 +02:00
Harshit chittora 76df9a734c Fix labels in confusion matrix for multi-label models (#787) 2019-07-26 00:07:25 +02:00
Marco Castelluccio 964250c01c Update the version of the commits DB, so it gets regenerated
After the fix from adb39df6bb.

Fixes #785
2019-07-25 23:57:57 +02:00
Marco Castelluccio adb39df6bb Switch to a version-control-tools version fixing an error with the 'bug' mercurial template 2019-07-25 23:56:38 +02:00