Drop-in replacement for unicodedata
with extensions for additional Unicode properties.
Currently supported additional Unicode properties:
- Script:
script(chr)
- Script Extensions:
script_extensions(chr)
- Block:
block(chr)
- Indic Conjunct Break:
indic_conjunct_break(chr)
- Indic Positional Category:
indic_positional_category(chr)
- Indic Syllabic Category:
indic_syllabic_category(chr)
- Grapheme Cluster Break:
grapheme_cluster_break(chr)
- Vertical Orientation:
vertical_orientation(chr)
- Age:
age(chr)
- Total Strokes (CJK):
total_strokes(chr, source='G')
- source='G' = Simplified stroke count. source='T' = Traditional stroke count.
- Emoji:
is_emoji(chr)
- Emoji Presentation:
is_emoji_presentation(chr)
- Emoji Modifier:
is_emoji_modifier(chr)
- Emoji Modifier Base:
is_emoji_modifier_base(chr)
- Emoji Component:
is_emoji_component(chr)
- Extended Pictographic:
is_extended_pictographic(chr)
Additionally, two dictionaries (property_value_aliases
and property_value_by_alias
) are provided for Property Value Alias lookup.
The versions of this package match unicode versions, so unicodedataplus==16.0.0 is data from unicode 16.0.0.
Forked from https://github.com/mikekap/unicodedata2