-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AutoMeta tool for reference #1
Comments
The group behind AutoMeta has published some papers about theses and dissertations. Here is a presentation about ETDSuite/ETDMiner (a library including AutoMeta), which aims to segment and parse theses and dissertations. The segmentation employs a multimodal model to classify pages into 13 categories. |
The poster paper "A Heuristic Baseline Method for Metadata Extraction from Scanned Electronic Theses and Dissertations" is most relevant for just metadata extraction, although from 2020:
|
This more recent paper by (mostly) the same authors goes into more detail: |
Found this AutoMeta tool for metadata extraction:
We could check the quality of its results compare to Meteor and LLM based methods.
The text was updated successfully, but these errors were encountered: