REEL: Relation Extraction for Entity Linking

Model for biomedical Named Entity Linking improved by Relation Extraction

Reference

Ruas, P., Lamurias, A. & Couto, F.M. Linking chemical and disease entities to ontologies by integrating PageRank with extracted relations from literature. J Cheminform 12, 57 (2020). https://doi.org/10.1186/s13321-020-00461-4

1. Setup

1.1. Docker

Build the Docker image from the Dockerfile:

docker build . --tag reel_image

Then run a docker container:

docker run -v $(pwd):/reel/ --name reel -it reel_image bash

1.2. Data

To download all the ontology and corpora files:

chmod +x get_data.sh
./get_data.sh

2. Usage

2.1. Apply the REEL model on custom input

If you have the ouput of a NER tool first store it in a json file with the same format as 'sample_input.json':

{
 "doc_1": ["hypertension", "diabetes mellitus", "diazepam", "GABA"],
 "doc_2": ["myocarditis", "heart failure", "acetaminophen"],
 "doc_3": ["hepatitis", "caffeine", "adrenaline"]
}

Then apply the REEL model to link the inputed entities to ChEBI concepts:

python run.py --run_label sample_run --input_file sample_input.json -target_kb chebi -model ppr_ic --link_mode corpus_kb_link

The output will be in the file 'sample_run_results.json':

{
"doc_1":{"diazepam":"CHEBI:49575", "gaba":"CHEBI:35621"},
"doc_2":{"acetaminophen":"CHEBI:22160"},
"doc_3":{"adrenaline":"CHEBI:33568", "caffeine":"CHEBI:27732"}
}

There are 3 target knowledge bases available: 'chebi', 'medic' and 'ctd-chem'.

To see more info about the input arguments:

python run.py -h

If instead you want to apply the baseline model run on the same input file:

python run.py --input_file sample_entities.json --target_kb chebi -model baseline

2.2. Apply the REEL model on evaluation dataset

To evaluate the REEL model on the dataset CRAFT-ChEBI run:

python run.py --dataset craft_chebi -model ppr_ic --link_mode corpus_link -target_kb chebi

The results are outputted to a file located in 'results/craft_chebi/ppr_ic/corpus_link' and printed in the terminal:

Total unique entities: 1679
Entities w/o solution (FN): 299
Wrong disambiguations (FP): 119
Correct disambiguations (TP): 1261
Precision: 0.913768115942029
Recall: 0.8083333333333333
Micro F1-score: 0.8578231292517008

Available datasets:

'craft_chebi' (target_kb = chebi)
'bc5cdr_medic_all' (target_kb = medic)
'bc5cdr_medic_train' (target_kb = medic)
'bc5cdr_medic_dev' (target_kb = medic)
'bc5cdr_medic_test' (target_kb = medic)
'bc5cdr_chemicals_all' (target_kb = ctd_chemicals)
'bc5cdr_chemicals_train' (target_kb = ctd_chemicals)
'bc5cdr_chemicals_dev' (target_kb = ctd_chemicals)
'bc5cdr_chemicals_test' (target_kb = ctd_chemicals)

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
candidates		candidates
results		results
src		src
temp		temp
Chemical_relations.json		Chemical_relations.json
Disease_relations.json		Disease_relations.json
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
chebi_relations.json		chebi_relations.json
full_model_temp.chebicraftresults.txt		full_model_temp.chebicraftresults.txt
get_data.sh		get_data.sh
ppr_for_ned_all.class		ppr_for_ned_all.class
requirements.txt		requirements.txt
run.py		run.py
sample_input.json		sample_input.json
sample_run_results.json		sample_run_results.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

REEL: Relation Extraction for Entity Linking

Reference

Table of contents:

1. Setup

1.1. Docker

1.2. Data

2. Usage

2.1. Apply the REEL model on custom input

2.2. Apply the REEL model on evaluation dataset

About

Releases

Packages

Languages

License

lasigeBioTM/REEL

Folders and files

Latest commit

History

Repository files navigation

REEL: Relation Extraction for Entity Linking

Reference

Table of contents:

1. Setup

1.1. Docker

1.2. Data

2. Usage

2.1. Apply the REEL model on custom input

2.2. Apply the REEL model on evaluation dataset

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages