ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.
Перейти к файлу
Kabir Khan 7a13a53dee fix typo 2020-07-26 01:17:21 -07:00
docs Adding for-the-badge style badges using shields.io and util for loading/saving to/from prodigy 2020-07-26 00:45:12 -07:00
examples type corrections in recon.v1.fix_annotations 2020-06-25 13:25:41 -07:00
recon fix typo 2020-07-26 01:17:21 -07:00
scripts Fixing imports 2020-03-20 15:15:32 -07:00
tests add operation tests 2020-07-17 14:31:35 -07:00
.coveragerc Increasing test coverage. Fixing bugs and inconsistencies 2020-03-20 14:49:55 -07:00
.env Initial Commit to Microsoft Repo 2020-02-20 11:55:50 -08:00
.gitignore Adding caching to preprocessing and adding docs 2020-04-19 14:21:29 -07:00
CODE_OF_CONDUCT.md Initial CODE_OF_CONDUCT.md commit 2020-02-20 11:52:12 -08:00
LICENSE Initial LICENSE commit 2020-02-20 11:52:13 -08:00
README.md Adding for-the-badge style badges using shields.io and util for loading/saving to/from prodigy 2020-07-26 00:45:12 -07:00
SECURITY.md Initial SECURITY.md commit 2020-02-20 11:52:14 -08:00
azure-pipelines.yml Rename module to recon from reconner. Fix types. 2020-02-27 15:22:14 -08:00
mkdocs.yml adding corpus tutorial docs 2020-05-19 19:33:04 -07:00
mypy.ini Initial Commit to Microsoft Repo 2020-02-20 11:55:50 -08:00
pyproject.toml type corrections in recon.v1.fix_annotations 2020-06-25 13:25:41 -07:00
streamlit_recon.py skip these dumb files in isort 2020-03-20 17:42:48 -07:00

README.md

Recon

Recon NER, Debug annotated Named Entity Recognition (NER) data for inconsitencies and get insights on improving the quality of your data.

PyPi Package version Azure Pipelines Build badge Codecov badge PyPi Package license


Documentation: https://microsoft.github.io/reconner

Source Code: https://github.com/microsoft/reconner


ReconNER is a library to help you fix your annotated NER data and identify examples that are hardest for your model to predict so you can strategically prioritize the examples you annotate.

The key features are:

  • Data Validation and Cleanup: Easily Validate the format of your NER data. Filter overlapping Entity Annotations, fix missing properties.
  • Model Insights: Analyze how well your model does on your Dataset. Identify the top errors your model is making so you can prioritize data collection and correction strategically.
  • Model Insights: Analyze how well your model does on your Dataset. Identify the top errors your model is making so you can prioritize data collection and correction strategically.
  • Dataset Management: ReconNER provides a Dataset class to manage the train/dev/test split of your data and apply the same functions across all splits in your data + a concatenation of all examples. Operate inplace to consistently transform your data.
  • Serializable Dataset: Serialize and Deserialize your data to and from JSON to the ReconNER type system.
  • Type Hints: Comprehensive Typing system based on Python 3.6+ Type Hints

Requirements

Python 3.6+

ReconNER is built on a few comprehensive, high-performing packages.

Installation

$ pip install reconner
---> 100%
Successfully installed reconner

License

This project is licensed under the terms of the MIT license.