Skip to content

GSOC2013_Progress_Hady Elsahar

hady elsahar edited this page Aug 8, 2013 · 36 revisions

integrating WikiData in DBpedia

proposal

full project proposal

Students

Mentors

  • Sebastian Hellman
  • Dimitris kontokostas

#Project Progress:

week 1 :

  • public clone of Extraction framework
  • preparing development environment
  • compiling the Extraction framework
  • Getting to know DBpedia main classes structures of the extraction framework

readings

important discussions :


week 2 [17-6-2013] :

  • exploring the PubSubHubbub Protocol
  • installing a local Hub and subscribing to some RSS Feed

Overview about the PubSubHubbub protocol

readings

important discussions :


week 3 [24-6-2013] :

  • Create a RDF dump out of 1-2K WikiData entities
  • work on the language links from API:
    1. process Wikidata info, generate master IL links file.
    2. produce language-specific same_as files from master IL links file,
  • Create a few mappings in the mappings wiki (as owl:equivalentProperty). The most common ones in the dumps

important discussions :


Language Links Extraction [1-7-2013] -> [1-8-2013] :

  • step 1: Creating Master LLinks file (replacing the old bash commands with scala code)
  • Step 2: Creating specific LLinks extraction in folders (after some number of code iterations we agreed upon that we can depend on that links comes in blocks ) , Implemented Algorithm
  • updating code to utilize some Extraction framework utilities instead of rewriting them
  • Code Reviews 1 , 2 ,3
  • More code reviews , some code conflicts

important links/Discussions :


--- off to Leipzig 2-8 > 6-8


week 8 [5-8-2013] - [11-8-2013] :

important discussions/Links :