Skip to content

unicodedata with extensions for additional properties

License

Notifications You must be signed in to change notification settings

iwsfutcmd/unicodedataplus

 
 

Repository files navigation

Github CI Status PyPI

unicodedataplus

Drop-in replacement for unicodedata with extensions for additional Unicode properties.

Currently supported additional Unicode properties:

  • Script: script(chr)
  • Script Extensions: script_extensions(chr)
  • Block: block(chr)
  • Indic Conjunct Break: indic_conjunct_break(chr)
  • Indic Positional Category: indic_positional_category(chr)
  • Indic Syllabic Category: indic_syllabic_category(chr)
  • Grapheme Cluster Break: grapheme_cluster_break(chr)
  • Vertical Orientation: vertical_orientation(chr)
  • Age: age(chr)
  • Total Strokes (CJK): total_strokes(chr, source='G')
    • source='G' = Simplified stroke count. source='T' = Traditional stroke count.
  • Emoji: is_emoji(chr)
  • Emoji Presentation: is_emoji_presentation(chr)
  • Emoji Modifier: is_emoji_modifier(chr)
  • Emoji Modifier Base: is_emoji_modifier_base(chr)
  • Emoji Component: is_emoji_component(chr)
  • Extended Pictographic: is_extended_pictographic(chr)

Additionally, two dictionaries (property_value_aliases and property_value_by_alias) are provided for Property Value Alias lookup.

The versions of this package match unicode versions, so unicodedataplus==16.0.0 is data from unicode 16.0.0.

Forked from https://github.com/mikekap/unicodedata2

About

unicodedata with extensions for additional properties

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C 98.1%
  • Python 1.9%