Platform for Machine Learning projects on Software Engineering
Перейти к файлу
Marco Castelluccio b19eb5adde Load 'en_core_web_sm' so we don't need a symlink in the Spacy installation directory.
Fixes #48
2018-12-21 14:45:05 +01:00
bugbug Load 'en_core_web_sm' so we don't need a symlink in the Spacy installation directory. 2018-12-21 14:45:05 +01:00
tests Replace code reference URLs with a CODE_REFERENCE_URL token (#46) 2018-12-19 15:24:10 +01:00
.gitignore Ignore *.model files 2018-11-22 00:28:13 +01:00
.isort.cfg Enable several flake8 checkers 2018-09-21 16:45:04 +02:00
.travis.yml Download spaCy en model 2018-12-13 23:12:29 +01:00
LICENSE First commit 2018-03-11 20:12:35 +00:00
MANIFEST.in Include labels in package 2018-11-24 00:32:41 +01:00
README.md Update documentation to match what the project became 2018-12-05 10:25:13 -05:00
VERSION Add setup.py and test building bugbug package (#1) 2018-11-19 22:53:17 +01:00
requirements.txt Update scikit-learn from 0.20.1 to 0.20.2 (#47) 2018-12-19 14:18:52 +01:00
run.py Consume iterators returned from bugzilla.download_bugs, so that bugs are actually downloaded 2018-12-17 23:37:51 +01:00
setup.cfg Enable several flake8 checkers 2018-09-21 16:45:04 +02:00
setup.py Load URL requirements too 2018-12-14 00:01:27 +01:00
test-requirements.txt Update pytest from 4.0.1 to 4.0.2 (#34) 2018-12-15 01:41:34 +01:00

README.md

bugbug

Classifiers

  • bug vs feature - Bugs on Bugzilla aren't always bugs. Sometimes they are feature requests, refactorings, and so on. The aim of this classifier is to distinguish between bugs that are actually bugs and bugs that aren't. The dataset currently contains 2110 bugs, the accuracy of the current classifier is ~93% (precision ~95%, recall ~94%).

  • regression vs non-regression - Bugzilla has a regression keyword to identify bugs that are regressions. Unfortunately it isn't used consistently. The aim of this classifier is to detect bugs that are regressions.

  • tracking - The aim of this classifier is to detect bugs to track.

Setup

Run pip install -r requirements.txt and pip install -r test-requirements.txt

If you update the bugs database, run xz -v9 -k data/bugs.json. If you update the commits database, run xz -v9 -k data/commits.json.