-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author disambiguation #77
Comments
The ORCID for all the zbmath authors in https://zenodo.org/records/7378860 have been inserted. Current statistics in the KG:
Next step: get Wikidata QID for as many humans as possible:
|
Given the zbMath ID I have matched them to items available in Wikidata. Only ~5% of the zbMath authors exist in Wikidata (with the zbmath identifier). For those where an ORCID was present, it has also been imported. Current statistics:
|
I've imported further Wikidata QIDs given the current ORCID in the KG. Current statistics:
|
Wikidata has author items that contain two zbMath IDs. For most of the cases this is wrong, which leads to our knowledge graph having the same Wikidata QID for two different zbmath authors. |
Issue description:
The current importers (CRAN, zbMath, polyDB) create entities for authors using ORCID ID, zbMath ID or no identifier.
For the cases in which an identifier exists, authors might have been created more than once by different importers.
Duplicate authors should be identified, merged and completed with information from Wikidata.
The dataset mentioned here (MaRDI4NFDI/portal-compose#344) can be useful for the task.
TODOS:
Acceptance-Criteria
Checklist for this issue:
The text was updated successfully, but these errors were encountered: