0355993660 | ||
---|---|---|
.. | ||
configs | ||
dataset_readers | ||
fig | ||
metrics | ||
models | ||
modules | ||
vocabulary/SoftDictHSCRF_vocabulary | ||
LICENSE | ||
README.md | ||
requirements_pip.txt |
README.md
Towards Improving Neural Named Entity Recognition with Gazetteers
This repository contains the open-sourced official implementation of our soft dictionary-enhanced NER model paper:
Towards Improving Neural Named Entity Recognition with Gazetteers (ACL 2019).
Tianyu Liu, Jin-Ge Yao, and Chin-Yew Lin
If you find this repo helpful, please cite either of the following versions of the paper:
@inproceedings{liu-etal-2019-towards,
title = {Towards Improving Neural Named Entity Recognition with Gazetteers},
author = {Tianyu Liu and Jin-Ge Yao and Chin-Yew Lin},
booktitle = {Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics},
year = 2019,
address = {Florence, Italy},
publisher = {Association for Computational Linguistics},
url = {https://aclanthology.org/P19-1524},
doi = {10.18653/v1/P19-1524},
pages = {5301--5307},
}
Overall architecture
Installation
First of all:
git clone https://github.com/microsoft/vert-papers.git ; cd papers/SubTagger
- Create a virtual environment with Conda
conda create -n softdict --file requirements_conda.txt -c conda-forge/label/broken -c conda-forge
- Activate the new environment
conda activate softdict
- Install the pip requirements
pip install -r requirements_pip.txt
- Prepare the configurations
sed -i 's@INSTALLATION_DIR@'"$PWD"'@' configs/*.config
Training
allennlp train configs/HSCRF_softDictionary.conll2003.config -s dump_directory/ --include-package models
Evaluating
allennlp evaluate dump_directory/model.tar.gz https://www.jeffliu.page/files/DATA/conll2003/test.txt --include-package models
The Gazetteer
The gazetteers, and the pretrained subtagger module can be found here