Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add identifiers on wikidata for authors ( P6745) #552

Open
alliyya opened this issue Mar 15, 2022 · 3 comments
Open

Add identifiers on wikidata for authors ( P6745) #552

alliyya opened this issue Mar 15, 2022 · 3 comments
Labels
priority:low type:enhancement Enhancement Request type:extraction Related to the extraction process type:task

Comments

@alliyya
Copy link
Member

alliyya commented Mar 15, 2022

I thought these were mostly done already by external contributors and past summer students but there are still some big missing ones.

Query to see if there's a wikidata item with the Orlando author ID (P6745)

These can more easily be matched through Mix & Match:
Wikidata Mix & Match

Here's the list of entries missing a wikidata identifier in case Mix & Match isn't quite up to date:

  • angema: Maya Angelou
  • arenha: Hannah Arendt
  • barrma: Maria Barrell
  • blacan: Antoinette Brown Blackwell
  • butls2: Sarah Butler
  • carsan: Anne Carson
  • corpha: Harriet Corp
  • dacian: Anne Dacier
  • deevt: Teresa Deevye
  • ecclch: Charlotte O'Conor Eccles
  • equiol: Olaudah Equiano
  • gallma: Mavis Gallant
  • guesch: Charlotte Guest
  • huxlal: Aldous Huxley
  • kembad: Adelaide Kemble
  • kingan: Anna Kingsford
  • lestel: Elizabeth B. Lester
  • levyde: Deborah Levy
  • lingsh: Ling Shuhua
  • mackan: Anna Maria Mackenzie
  • makiba: Bathsua Makin
  • malelu: Lucas Malet
  • man_ju: Judith Man
  • manthi: Hilary Mantel
  • mastma: Mary Masters
  • mayofl: Flora Macdonald Mayor
  • mccuca: Carson McCullers
  • mcgume: Medbh McGuckian
  • mcwica: Candia McWilliam
  • middje: Jean Middlemass
  • mildgr: Grac Lady Mildmay
  • milnch: Christian Milne
  • minima: Margaret Minifie
  • moffgw: Gwen Moffat
  • mollma: Mary Mollineux
  • montc2: Charlotte Montefiore
  • mordel: Elinor Mordaunt
  • morema: Mary More
  • mortpe: Penelope Mortimer
  • moulma: Martha Moulsworth
  • muirwi: Willa Muir
  • nootch: Charlotte Nooth
  • nottka: Kathleen Nott
  • oaklan: Ann Oakley
  • ogilel: Eliza Ogilvy
  • oglean: Anne Ogle
  • okeead: Adelaide O'Keeffe
  • omanca: Carola Oman
  • oneifr: Frances O'Neill
  • opieam: Amelia Opie
  • oxlima: Mary Oxlie
  • oyeyhe: Helen Oyeyem
  • paderu: Ruth Padel
  • pagelo: Louise Page
  • palmma: Mary Palmer
  • pankch: Christabel Pankhurst
  • pankem: Emmeline Pankhurst
  • panksy: Sylvia Pankhurst
  • pantmo: Mollie Panter-Downes
  • pardju: Julia Pardoe
  • parkbe: Bessie Rayner Parkes
  • parkel: Elizabeth Mary Parker
  • parkem: Emma Parker
  • parkma: Mary Ann Parker
  • parsel: Eliza Parsons
  • pastge: George Paston
  • pearfr: Frances Mary Peard
  • pearsu: Sarah Pearson
  • peckwi: Winifred Peck
  • peisma: Mary Peisley
  • pethem: Emmeline Pethick-Lawrence
  • pfeiem: Emily Jane Pfeiffer
  • pilkla: Laetitia Pilkington
  • pinnw: Winsome Pinnocki
  • pittru: Ruth Pitter
  • platsy: Sylvia Plath
  • plumce: C. E. Plumptre
  • reynfr: Frances Reynolds
  • riddma: Maria Riddell
  • robiis: Isabella Hamilton Robinson
  • rossca: Catharine Colace Ross
  • russje: Jessie Russell
  • rymeja: James Malcolm Rymer
  • savama: Mary Savage
  • schugl: Gladys Henrietta Schütze
  • seniol: Olive Senior
  • shamka: Kamila Shamsie
  • sharja: Jane Sharp
  • shawhe: Hester Shaw
  • shorar: Arabella Shore
  • shutpe: Penelope Shuttle
  • sincca: Catherine Sinclair
  • sleael: Eleanor Sleath
  • slovgi: Gillian Slovo
  • smedco: Constance Smedley
  • smital: Ali Smith
  • smitdo: Dodie Smith
  • smitlu: Lucy Toulmin Smith
  • smytha: Harriet Smythies
  • smytsu: Susan Smythies
  • soutjo: Joanna Southcott
  • sowegi: Githa Sowerby
  • sparmu: Muriel Spark
  • spegra: Rachel Speght
  • spenel: Elizabeth Isabella Spence
  • spenem: Emily Spender
  • squija: Jane Squire
  • starma: Mariana Starke
  • steean: Anna Steele
  • stirel: Elizabeth Stirredge
  • stjoch: Christopher St John
  • stocma: Mary Stockdale
  • stonel: Elizabeth Stone
  • stotma: Mary Stott
  • straju: Julia Strachey
  • strara: Ray Strachey
  • streju: Julia Stretton
  • streno: Noel Streatfeild
  • striag: Agnes Strickland
  • striel: Elizabeth Strickland
  • struja: Jan Struther
  • sultma: Maud Sulter
  • sumble: Leah Sumbel
  • sutcal: Alice Sutcliffe
  • swana2: Annie S. Swan
  • swanan: Anna Swanwick
  • sykehe: Henrietta Sykes
  • tatlel: Eleanor Tatlock
  • taylha: Harriet Taylor
  • teftel: Elizabeth Teft
  • temped: Edith Templeton
  • tennem: Emma Tennant
  • thican: Ann Thicknesse
  • thiran: Angela Thirkell
  • thomdy: Dylan Thomas
  • thomfl: Flora Thompson
  • thoral: Alice Thornton
  • tinsan: Annie Tinsley
  • tippel: Elizabeth Tipper
  • tollel: Elizabeth Tollet
  • tomlel: Elizabeth Sophia Tomlins
  • townsu: Sue Townsend
  • trapan: Anna Trapnel
  • travre: Rebecca Travers
  • treeir: Iris Tree
  • treevi: Viola Tree
  • trefvi: Violet Trefusis
  • tremro: Rose Tremain
  • veitso: Sophie Veitch
  • wallan: Ann Wall
  • wastel: Elisabeth Wast

There's also one identifier that seems to possibly have 2 corresponding wikidata entries:

  • joscel: Elizabeth Joscelin

The property for Orlando author ID (P6745) is also a bit dated since the new interface is out, particularly the formatter URL and URL match pattern properties. Whether or not we want to update that ourselves or wait for another wikidata contributor to fix it might be another issue.

@alliyya alliyya added type:enhancement Enhancement Request priority:low type:extraction Related to the extraction process type:task labels Mar 15, 2022
@SusanBrown
Copy link
Contributor

I thought these were all done too!

The additions could be a good training assignment for one or more students.

Is there a programmatic way to update the properties? Does the divergence break the links from MixNMatch?

@alliyya
Copy link
Member Author

alliyya commented Mar 15, 2022

The Orlando author ID property links out to the older URL, and mix and match uses that https://orlando.cambridge.org/protected/svPeople?formname=r&subform=1&person_id=angema&adt_start_year=0612&crumbtrail=on&dt_end_cal=AD&dt_end_day=05&dt_end_month=11&dt_end_year=2018&dt_start_cal=BC&dts_historical=0612--+BC%3A2018-10-19&dts_lives=0612--+BC%
image

Only the property page would need to be updated in those 2 fields and then it would be updated every else.

@alliyya
Copy link
Member Author

alliyya commented Mar 15, 2022

Looking more at the mix and match results because the numbers aren't adding up. There's >100 writers that I'm not seeing identifiers for in wikidata, but mix/match says there's only 26 that haven't been reconciled? Maybe some weird syncing thing on their end?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
priority:low type:enhancement Enhancement Request type:extraction Related to the extraction process type:task
Projects
None yet
Development

No branches or pull requests

2 participants