Refactor decoding format, escape unconvertable #325

AustinMroz · 2024-11-29T16:57:32Z

I've had underlying concern for a while now that universally using utf-8 isn't correct because it doesn't respect locale and this serves as fairly strong confirmation. From a little bit of digging, checking seems as simple as calling locale.getencoding(), but the documentation claims that this is ANSI for windows (meaning changing it would affect the majority of users). As a result, I've refactored out all the string decoding to use a common variable and set it to display an escaped version of any unconvertable characters, but am leaving the format as utf-8 until I have further information.

See #324

I've had underlying concern for a while now that universally using utf-8 isn't correct because it doesn't respect locale and this serves as fairly strong confirmation. From a little bit of digging, checking seems as simple as calling `locale.getencoding()`, but the documentation claims that this is ANSI for windows (meaning changing it would affect the majority of users). As a result, I've refactored out all the string decoding to use a common variable and set it to display an escaped version of any unconvertable characters, but am leaving the format as utf-8 until I have further information. See #324

AustinMroz merged commit b0f9796 into main Nov 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor decoding format, escape unconvertable #325

Refactor decoding format, escape unconvertable #325

AustinMroz commented Nov 29, 2024

Refactor decoding format, escape unconvertable #325

Refactor decoding format, escape unconvertable #325

Conversation

AustinMroz commented Nov 29, 2024