Covid-19 Semantic Search Engine Website

This Project is part of Society Impact Project - Electronics and Electrical Communication Department - Faculty of Engineering - Cairo University - Dr. Hassan Mostafa

Developed a website like a search engine for the medical community to help them in their fight against COVID-19 Under Supervision of Dr. Hassan Mostafa

This Repo contains a website for a COVID-19 Search Engine that can be used by Medical community to search for topics in covid-19 published papers, it's based on LDA and trained on +40K papers

Dataset - COVID-19 Open Research Dataset (CORD-19):

COVID-19 Open Research Dataset (CORD-19) is a free resource of scholarly articles, aggregated by a coalition of leading research groups, about COVID-19 and the coronavirus family of viruses. The dataset can be found on Semantic Scholar and there is a research challenge on Kaggle.

This project builds an index over the CORD-19 dataset to assist with analysis and data discovery. A series of tasks were explored to identify relevant articles and help find answers to key scientific questions on a number of COVID-19 research topics.

Tasks

The following files show the top query results for each task provided in the CORD-19 Research Challenge using this model. A highlights section is also shown for each task, which highlights the most relevant sentences from the query results.

Installation

You can use Git to clone the repository from GitHub and install it.

Python 3.5+ is supported

Building a model

Download all the files in the Download CORD-19 section on Semantic Scholar. Go the directory with the files and run the following commands.

cd <download_path>

For each tar.gz file run the following mkdir && tar -C -xvzf <file.tar.gz>

Once completed, there should be a file name metadata.csv and subdirectories for each data subset with all json articles.

To build the model locally:

# run loader.py to prepare the dataset
python -m loader.py 

# Build model files 
python -m model.py

The model will be stored in the same directory

Tech Overview

The model is a built on LDA and using CountVectorizer

Please find attached the video with a demo for the website working at this link

Under Supervision of:

Dr Hassan Mostafa

Submitted by:

Abdallah Ahmed
Abdelrahman Ahmed
Mohamed Sabry
Mohamed Abd Elhalim
Youssef Mostafa

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
__pycache__		__pycache__
static		static
templates		templates
README.md		README.md
app.py		app.py
loader.py		loader.py
main.py		main.py
model.py		model.py
runServer.py		runServer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Covid-19 Semantic Search Engine Website

Developed a website like a search engine for the medical community to help them in their fight against COVID-19 Under Supervision of Dr. Hassan Mostafa

Dataset - COVID-19 Open Research Dataset (CORD-19):

Tasks

Installation

Building a model

Tech Overview

Please find attached the video with a demo for the website working at this link

Under Supervision of:

Submitted by:

About

Releases

Packages

Languages

Youssef-Matloob/Covid-19-Semantic-Search-Engine-Website

Folders and files

Latest commit

History

Repository files navigation

Covid-19 Semantic Search Engine Website

Developed a website like a search engine for the medical community to help them in their fight against COVID-19 Under Supervision of Dr. Hassan Mostafa

Dataset - COVID-19 Open Research Dataset (CORD-19):

Tasks

Installation

Building a model

Tech Overview

Please find attached the video with a demo for the website working at this link

Under Supervision of:

Submitted by:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages