Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Review altlabels and hidden labels #555

Open
alliyya opened this issue May 5, 2022 · 2 comments
Open

Review altlabels and hidden labels #555

alliyya opened this issue May 5, 2022 · 2 comments
Labels
priority:routine project:CWRC Ontology Regarding the CWRC Ontology project:Genre Ontology Regarding the Genre Ontology status:help wanted type:extraction Related to the extraction process type:idea Idea that should be discussed type:task

Comments

@alliyya
Copy link
Member

alliyya commented May 5, 2022

These labels are used in the mapping for conversion so it would be good to assess whether or not they are appropriate matches or if a new term should be created.

This is a tedious task of going through each term and their altlabels and also looking through the ontology for better matches. But it will result in more accurate data extraction.

Example:
Advertising has some dubious alternative labels as publicity != advertising necessarily unless there's missing context on my end.

image

@alliyya alliyya added status:help wanted priority:routine project:CWRC Ontology Regarding the CWRC Ontology project:Genre Ontology Regarding the Genre Ontology type:extraction Related to the extraction process type:task type:idea Idea that should be discussed labels May 5, 2022
@alliyya
Copy link
Member Author

alliyya commented Oct 12, 2022

Also would be good to review hidden labels, as the data could be enhanced by adding an appropriate reg attribute instead of adding weird hidden label that doesn't add any meaning to the entity.

ex. cwrc:nursing
We have the following unsightly hidden labels:

fn helped found the east london nursing society.

in 1877 she nursed george odger, the first parliamentary labour candidate, during his fatal illness.

as hidden labels as a result of the following tags,

<SIGNIFICANTACTIVITY> <NAME STANDARD="Nightingale, Florence" REF="https://commons.cwrc.ca/orlando:55ac16d5-7dd1-40d4-95f5-2324fe707e70">FN</NAME> helped found the <ORGNAME STANDARD="East London Nursing Society" REF="https://commons.cwrc.ca/orlando:89689b91-a010-4e75-bcad-772eca6902ec">East London Nursing Society</ORGNAME>. </SIGNIFICANTACTIVITY>

<SIGNIFICANTACTIVITY>In 1877 she <SIGNIFICANTACTIVITY REG="nursing">nurse</SIGNIFICANTACTIVITY>d <NAME STANDARD="Odger, George" REF="https://commons.cwrc.ca/orlando:b428dd9d-c47a-41cd-a7de-e7d79b2c898f">George Odger</NAME>, the first parliamentary labour candidate, during his fatal illness.<BIBCITS> <BIBCIT PLACEHOLDER="DNB" DBREF="1759" REF="https://commons.cwrc.ca/orlando:b9ec4e5c-1b88-4be0-83b2-e9dddcb6b851"/> </BIBCITS> </SIGNIFICANTACTIVITY>

These tags both likely could be enhanced instead of being a hidden label for cwrc:nursing.

Work would be needed to go through hidden labels, make a judgement call and then investigate the context and then make additions to the Orlando Clean up spreadsheet as needed

@SusanBrown
Copy link
Contributor

Good idea. Aliza may be able to help with this kind of thing too.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
priority:routine project:CWRC Ontology Regarding the CWRC Ontology project:Genre Ontology Regarding the Genre Ontology status:help wanted type:extraction Related to the extraction process type:idea Idea that should be discussed type:task
Projects
None yet
Development

No branches or pull requests

2 participants