GitHub - Minou21/horus-ner: HORUS: A framework to boost NLP tasks

Boosting NER

HORUS is meta and multi-level framework designed to provide a set of features at word-level to boost natural language frameworks. It's architecure is based on image processing and text classification clustering algorithms and shows to be helpful especially to noisy data, such as microblogs.

We are currently investigating Named Entity Recognition (NER) as use case. This version supports the identification of classical named-entity types (LOC, PER, ORG).

Easy Setup (Python 2.7 based)

Setup the python environment

setup Anaconda
conda env create -f horus.v015.yml (tested for OSX) (*)
source activate horus_27_cv310

Setup the framework environment

setup SQLite database and run our script to create the schema
get your Microsoft Bing API Key and Microsoft Translator API Key to query the Web.
configure the parameters at the .ini file (copy horus_dist.ini to ~/horus.ini)

(*) - setup openCV 3.1: OSx users can benefit from anaconda, which provides a running version of OpenCV 3.1.0 (see more at #issue 6)

Usage

python main.py --input_text="whitney houston has been honored in nyc" --ds_format=0 --output_format="csv"

python main.py --input_file="sentences.txt" --ds_format=0

python main.py --input_file="ritter_ner.tsv" --ds_format=1 --output_file="metadata" --output_format="json"

to process an input file (e.g., conll) and generate the horus feature file format

examples/process_input_file.py

to convert a horus input file format to conll

horus/core/data_conversion.py

to run some benchmarks

experiments/benchmarking.py

Version

0.1.0 initial version
0.1.1 adding text classification
0.1.2 adding map detection
0.1.5 paper version

Name		Name	Last commit message	Last commit date
Latest commit History 332 Commits
data		data
dist		dist
experiments		experiments
horus		horus
output		output
webservice		webservice
.gitattributes		.gitattributes
.gitignore		.gitignore
CNAME		CNAME
LICENSE		LICENSE
MANIFEST		MANIFEST
README.BACKLOG		README.BACKLOG
README.md		README.md
README.rst		README.rst
_config.yml		_config.yml
horus.v015.yml		horus.v015.yml
horus0.1.5.db.sql		horus0.1.5.db.sql
horus_output.html		horus_output.html
metadata.py		metadata.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Boosting NER

Easy Setup (Python 2.7 based)

Usage

Version

About

Releases

Packages

Languages

License

Minou21/horus-ner

Folders and files

Latest commit

History

Repository files navigation

Boosting NER

Easy Setup (Python 2.7 based)

Usage

Version

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages