A column oriented dataset that can be used for named-entity recognition.
Assuming you already have python 2.7, pip 9, java 11,
Download jena 3.9 and update classpath:
export CLASSPATH=${CLASSPATH}:YOUR-JENA-DIR-PATH/lib/*Create a new virtualenv:
install Cython
python -m pip install --upgrade cythonInstall dependencies with pip:
pip install -r requirements.txtpython ner_dataset.py