Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhancement: Report invalid unicode sequences encountered in PDF #815

Open
markpenny opened this issue Jan 18, 2023 · 5 comments
Open

Enhancement: Report invalid unicode sequences encountered in PDF #815

markpenny opened this issue Jan 18, 2023 · 5 comments
Assignees
Labels
enhancement New feature or request

Comments

@markpenny
Copy link
Collaborator

If a dotted circle character appears in the output PDF, it indicates the presence of dirty data, or broken output data. It would be extremely useful to alert the user to such problems, preferably by scanning the PDF for the glyph “◌” (U+25CC) and reporting which pages it occurs on. I would expect this to be an option on the Finishing tab, or possibly on the Advanced tab, which would be OFF by default, but when turned on would produce a status bar warning message, or even an error dialog.

@markpenny markpenny added the enhancement New feature or request label Jan 18, 2023
@mhosken
Copy link
Collaborator

mhosken commented Feb 6, 2023

I wonder if we need a "Test" tab in which people can press fun buttons like: scan PDF for dotted circles, or search for missing fonts, or things like that and a process runs and gives a report in that tab.

@mhosken
Copy link
Collaborator

mhosken commented Apr 26, 2023

This is quite a bit of work since the fonts in the PDF may not include a 25CC cmap entry. We would need to parse the font files to get the GID and then hope that the GIDs are not recoded by xdvipdfm-x or xetex.

@markpenny
Copy link
Collaborator Author

@mhosken Given that you're now doing a lot of this kind of scanning of the XDV file, is this a good time to add something like this to the Finishing tab? "Check for cracks" / "Test" PDF.

@mhosken
Copy link
Collaborator

mhosken commented Oct 23, 2024

This is part of #995

@markpenny
Copy link
Collaborator Author

We should also parse the log file for "missing character in font" messages and present these to the user.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants