CoT-Sparql

This repository contains the code for generating SPARQL queries from natural language text using the chain of thought approach.

Run the System Locally

To run the system locally, follow one of the methods below based on your preferences and settings:

Using Conda

If you are using Conda, create the environment with the following commands:

conda env create -f environment.yml
conda activate sparqlgen

Without Conda

If you are not using Conda, you can create the environment with this command:

pip install -r requirements.txt

Requirments

We have provided the sentence embeddings and other relevant information, but users can create their own embeddings using the notebook in the temp folder. To obtain the context examples (embeddings and other information), download the necessary files to the temp directory using the following command:

wget https://anon.to/?https://files.dice-research.org/datasets/COT-SPARQLGEN/dbpedia_examples.parquet https://anon.to/?https://files.dice-research.org/datasets/COT-SPARQLGEN/embeddings_dbpedia.pkl https://anon.to/?https://files.dice-research.org/datasets/COT-SPARQLGEN/embeddings_wikidata.pkl https://anon.to/?https://files.dice-research.org/datasets/COT-SPARQLGEN/wikidata_examples.parquet

Run

You are now ready to run the system with the following command:

python main.py --model_path  --kb  --question

For example:

python main.py --model_path TheBloke/CodeLlama-34B-Instruct-GPTQ --kb dbpedia --question 'what is the capital of Germany'

You can select the knowledge base using dbpedia or wikidata. Also, you can change the model based on the available system.

Datasets

We used the following datasets during our experiments:

Wikidata	Dbpedia
QALD-10	QALD-9
LcQuad2.0	Vquanda

The datasets are available for download in the dataset folder of our repository.

Embeddings

We have provided a link to the embeddings and relevant data. However, if you wish to create your own embeddings, the code is available in the temp folder, named embeddings.ipynb. We utilized all-MiniLM-L6-v2 for sentence encoding, but users may change it according to their requirements.

Model Used for Entity Linking and Relation Extraction:

contexta.py provides all the details about the models and tools used for entity linking for DBpedia and Wikidata, respectively. This is also optional, and users may use the tools of their preference.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Other_LLMs		Other_LLMs
dataset		dataset
temp		temp
COTs-SPARQL.pdf		COTs-SPARQL.pdf
COTs-SPARQL.svg		COTs-SPARQL.svg
NL-SPARQL.ipynb		NL-SPARQL.ipynb
README.md		README.md
arch.png		arch.png
contexta.py		contexta.py
contextb.py		contextb.py
environment.yml		environment.yml
main.py		main.py
requirements.txt		requirements.txt
spacy_component.py		spacy_component.py
validation.py		validation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CoT-Sparql

Run the System Locally

Using Conda

Without Conda

Requirments

Run

Datasets

Embeddings

Model Used for Entity Linking and Relation Extraction:

About

Releases

Packages

Contributors 2

Languages

dice-group/CoT-Sparql

Folders and files

Latest commit

History

Repository files navigation

CoT-Sparql

Run the System Locally

Using Conda

Without Conda

Requirments

Run

Datasets

Embeddings

Model Used for Entity Linking and Relation Extraction:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages