Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Errors identified after merge operation of bibliographic resources #37

Open
arcangelo7 opened this issue Dec 7, 2024 · 1 comment
Open
Assignees
Labels
bug Something isn't working

Comments

@arcangelo7
Copy link
Collaborator

After performing the merge operation on bibliographic resources, several issues have been detected during the automatic checks. Below is a summary of the problems:

Pre-existing Issues

The following entities already exhibited issues prior to the merge operation and should be addressed separately:

  • https://w3id.org/oc/meta/br/062302874558: Multiple partOf relations.
  • https://w3id.org/oc/meta/br/061301736112: Multiple partOf relations.
  • https://w3id.org/oc/meta/br/061303427175: Multiple partOf relations.
  • https://w3id.org/oc/meta/br/061201702562: Multiple partOf relations.
  • https://w3id.org/oc/meta/br/06102356785: Multiple partOf relations.

Errors Introduced by Merge

  1. Invalid Journal Issue Assignment:

    • Entity https://w3id.org/oc/meta/br/062103749535 now has a journal issue property, which is incorrect since it is a journal article. This error occurred due to merging with:
      • https://w3id.org/oc/meta/br/062503506057 (journal issue)
      • https://w3id.org/oc/meta/br/06601359707 (journal issue)
  2. Missing Identifiers:

    • Entities https://w3id.org/oc/meta/br/06501844005, https://w3id.org/oc/meta/br/062603894119, and https://w3id.org/oc/meta/br/061903886397 lack the property datacite:hasIdentifier. This issue could not be reproduced through synthetic tests on the same data.
  3. Missing Type:

    • Entity https://w3id.org/oc/meta/br/062103749314 was expected to be merged and deleted, but it persists without a type. The issue seems to be caused by a failed SPARQL update query, which removed only the type property while leaving other properties intact.

Suggested Fixes

  • Merge Logic: Update the merge operation to enforce entity constraints when transferring properties, preventing the assignment of invalid properties to entities.
  • Missing Identifiers: Investigate the cause of the missing identifiers, as the root of this issue remains unexplained.
  • Missing Type: Manually delete the remaining entity https://w3id.org/oc/meta/br/062103749314 to clean up the dataset.

Additional Notes

Issues unrelated to the merge (e.g., multiple partOf relations) should be logged and addressed independently as part of broader dataset cleanup efforts.

@arcangelo7 arcangelo7 self-assigned this Dec 7, 2024
@arcangelo7 arcangelo7 added the bug Something isn't working label Dec 7, 2024
@arcangelo7
Copy link
Collaborator Author

The following issues identified after the merge operation have been resolved manually:

  1. Missing Identifiers:

    • Entities that lacked the datacite:hasIdentifier property have been updated manually.
  2. Missing Type:

    • The entity https://w3id.org/oc/meta/br/062103749314, which persisted without a type after the merge, has been manually deleted to clean up the dataset.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant