Skip to content

Mining an Augmented Graph using INK, starting from a CSV

License

Notifications You must be signed in to change notification settings

KNowledgeOnWebScale/Magic

Repository files navigation

Magic: Mining an Augmented Graph using INK, starting from a CSV

Magic is a tool to transform CSV files into semanticly annotated, structured data while also being able to further augment the dataset based on these semantic annotations.

Magic is higly dependent upon INK, an approach which can generate intepretable embeddings.
More information about INK can be found at: https://github.com/IBCNServices/INK

Magic also uses a HDT-based triple store to extract semantic information.
More information about HDT can be found at: https://www.rdfhdt.org

Three main files are provideed in this repository:

  • MAGIC.py: defines the general Magic class to perform semantic annotations given a csv file.
  • MAIN_MAGIC.py: shows an example how magic can be used to derive semantic annotations from within the Wikidata knowledge graph
  • MAIN_MAGIC_DB.py: shows an example how magic can be used to derive semantic annotations from within the DBpedia knowledge graph

To search for possible candidates matches, magic uses either:

Demo

To run demo application, pip install all packages inside the requirements.txt file
You also need the dbpedia-2016-10 hdt files to build the candidates neighbourhood. You can download it either from the HDT website or from https://www.kaggle.com/bsteenwi/dbpedia.
Place the hdt and index file in the same folder as the StreamlitApp.py file.

Next, execute the following command inside a terminal window:

streamlit run StreamlitApp.py

A video of this demo application is also made available here

How to cite:

Comming soon, Magic is being used at the ISWC2021 Tabular Data to Knowledge Graph Matching" competition.
A paper describing the full system will be made available.

About

Mining an Augmented Graph using INK, starting from a CSV

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages