How to improve detection of sections? #65

ldt · 2023-12-26T16:14:35Z

Hi,

Congrats for your great work and beautiful API!

I'm especially interested in using it to create a hierarchical document based on the original PDF.
My issue is that some sections are not correctly identified.

For example in your papermage.pdf file, the 2nd section is mixed with the 2.1 section:

And the title of the 3.3 section is partially identified:

I have similar issues on some of my documents.

I would like to know how it could be improved. Could it be more trained if there was a training set of documents with the correct sections that were pre-identified?

Let me know how I could help, the topic is really interesting!

MpLebron · 2024-03-24T12:15:31Z

I have the same problem! Hope the authors can sovle this!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to improve detection of sections? #65

How to improve detection of sections? #65

ldt commented Dec 26, 2023 •

edited

Loading

MpLebron commented Mar 24, 2024

How to improve detection of sections? #65

How to improve detection of sections? #65

Comments

ldt commented Dec 26, 2023 • edited Loading

MpLebron commented Mar 24, 2024

ldt commented Dec 26, 2023 •

edited

Loading