Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finding all "Early Print"-IDs for Shakespeare plays #1

Open
peertrilcke opened this issue Apr 8, 2022 · 6 comments
Open

Finding all "Early Print"-IDs for Shakespeare plays #1

peertrilcke opened this issue Apr 8, 2022 · 6 comments
Assignees

Comments

@peertrilcke
Copy link
Member

No description provided.

@lucagiovannini7
Copy link
Member

Bestand: 27 Dramen (4 Suchergebnisse waren Gedichte)

earlyprint_shakespeare.csv

@lehkost lehkost reopened this Apr 13, 2022
@lehkost
Copy link
Member

lehkost commented Apr 13, 2022

What's with the rest of the 37 canonic Shakespeare plays? Are they just not there? If so, why are 10 missing?

@lucagiovannini7
Copy link
Member

lucagiovannini7 commented Apr 16, 2022

An Early Print query with "author" = "Shakespeare, William" returns 31 hits (27 plays, including the First Folio of 1623 [A11954], which in turns contains 36 plays). The metadata list provided by Ingo has 36 hits for Shakespeare or "W.S.", but some of them refer to the Shakespearean apocrypha (e.g. The Puritan [A11264]). A follow-up question would be: are we planning to eventually merge ShakeDraCor into EngDraCor, or keep it as a separate corpus? In the second option, Shakespearean texts might be excluded from EngDraCor.

@lehkost
Copy link
Member

lehkost commented Apr 16, 2022

Oh, interesting, thanks! Would be good to know which plays of the Shakespeare canon are missing (of if they just have wrong author information).

As for the other question, I'd say we keep ShakeDraCor separate as it's a whole other edition with other encoding. In parallel, we established GerShDraCor for a parallel corpus of the German Shakespeare translations by Schlegel/Tieck. It's still in beta, but not much left to do to release it officially, see here.

@lucagiovannini7
Copy link
Member

In principle, all plays in ShakeDraCor (with the exception of Pericles) are also included in EarlyPrint via the First Folio (https://texts.earlyprint.org/works/A11954.xml).
As "single-file" plays, the following are missing: The Tempest, Two Gentlemen of Verona, Measure for Measure, The Comedy of Errors, A Midsummer Night's Dream, As You Like It, All's Well That End Well, Twelfth Night, The Winter's Tale, King John, Henry VI/Part 1, Henry VIII, Coriolanus, Timon of Athens, Macbeth, Antony and Cleopatra, Cymbeline.

@lehkost
Copy link
Member

lehkost commented Apr 19, 2022

Thanks for looking up the details! We had the same problem with GerDraCor when we tried to understand how to cut out plays from TextGrid Repository, see here. Files containing multiple plays were identified by the <teiCorpus> element and we divided them into single files (one per play). If we do the same for EngDraCor, we would have to decide which version of a Shakespeare play we want to have in our corpus (the standalone versions, or the Folio versions, assuming that they are different?) – we can also have different editions/versions of a play in the corpus, as is the case in ItaDraCor and FreDraCor.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants