InformationExtraction

Documentation for the project is available in project wiki

Finish the build process.
Copy the ie-dist folder to your spark cluster.
RunSparkBatchDriver.sh will start the batch processing, where you can input sentences or (hdfs) file paths.
To run Relation Evaluation, please refer to RunRelationEvaluation.sh, where you might need to change the file location according to your cluster settings. Please copy the data folder to your cluster and upload to hdfs before running evaluation (This only need to be performed once).

Refer to config.properties for configuration change, such like pipeline components, NER models, dictionary and regex rules;
Cutomized training for NER and Relation Extractor can be supported by com.intel.ie.training package.

Name		Name	Last commit message	Last commit date
Latest commit History 345 Commits
backup		backup
data		data
doc/images		doc/images
lib		lib
script		script
src/main		src/main
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
config.properties		config.properties
pom.xml		pom.xml

Provide feedback