Information Retrieval Project

About

This repository contains the code of our project for the NWI-I00041 Information Retrieval course at Radboud University (2019-2020).

Abstract:
In this paper, we will investigate whether Wikipedia-based query expansion can be used to improve information retrieval systems. Our approach uses the Wikipedia and DBpedia knowledge bases to generate possible expansion candidates. These expansion candidates are then ranked using Explicit Semantic Analysis (ESA), and the top candidates are selected to expand the query. We conducted several experiments to compare different methods of querying Wikipedia, different numbers of expansion candidates, and different filtering methods. Unfortunately, the results showed that none of our implementations outperformed the baseline information retrieval system.

Structure

The code has been organised as follows:

IR Query Expansion is a Java (NetBeans) project that contains the code for query expansion. Further details about the requirements can be found in the README.txt in the project.
IR Wikipedia Index is a Java (NetBeans) project that contains the code for retrieving query-expansion-candidates from Wikipedia. This project was not integrated with IR Query Expansion, because the projects require different Lucene versions which causes conflicts. Further details about the requirements can be found in the README.txt in the project.
experiment is a directory that contains the expanded queries we used, the evaluation metrics we obtained, and the plots we generated. Furthermore, it contains two Python files evaluate_experiment.py and generate_plots.py. The former runs our experiments to obtain the evaluation metrics and the latter generates the interpolated precision-recall plots from the obtained evaluation metrics.

Contributors

Freek van den Bergh
Max Driessen
Marlous Nijman

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
IR Query Expansion		IR Query Expansion
IR Wikipedia Index		IR Wikipedia Index
experiment		experiment
.gitignore		.gitignore
README.md		README.md
anserini-0.6.0-fatjar.jar		anserini-0.6.0-fatjar.jar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Information Retrieval Project

About

Structure

Contributors

About

Releases

Packages

Languages

fbergh/IR_Project

Folders and files

Latest commit

History

Repository files navigation

Information Retrieval Project

About

Structure

Contributors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages