Skip to content

mvdheram/IRMiniProject

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

IRMiniProject

Create a Search engine over Wikipedia and testing the quality of the search against the QALD 7 train multilingual dataset.

QALD 7 train multilingual dataset

https://github.com/ag-sc/QALD/blob/master/7/data/qald-7-train-multilingual.json

Wikipedia dump file for indexing

http://hobbitdata.informatik.uni-leipzig.de/teaching/corpora/enwiki-20171103-pages.tsv

Helper Library to measure the quality of the search(F-measure)

https://github.com/dice-group/NLIWOD/tree/master/qa.commons

Indexing Engine

Elastic search version (7.3.3)

About

Indexing Wikipedia dump using Elastic Search

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages