A column oriented dataset that can be used for named-entity recognition.
Assuming you already have python 2.7
, pip 9
, java 11
,
Download jena 3.9 and update classpath:
export CLASSPATH=${CLASSPATH}:YOUR-JENA-DIR-PATH/lib/*
Create a new virtualenv:
install Cython
python -m pip install --upgrade cython
Install dependencies with pip:
pip install -r requirements.txt
python ner_dataset.py