Skip to content

MotorolaMobilityLLC/external-hyphenation-patterns

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

78 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

URL: https://github.com/hyphenation/tex-hyphen
Version: ecf976ab6995acb653d38ab1af0b9b9829ec0c77
License: various
License File: NOTICE

The tex-hyphen package contains hyphenation patterns under several different licenses.
Patterns licensed under the following open source licenses can be imported here:
* Apache 2.0
* Unlicense
* MIT
* BSD
* MPL
* LGPL 2.1

Description:
Hyphenation patterns for automatic hyphenation from TeX hyphenation patterns
project at http://www.hyphenation.org.

A few languages are from other sources with the individual repositories as listed below:
Czech (cs) - https://github.com/tensojka/cshyphen
Dutch (nl) - https://github.com/OpenTaal/opentaal-hunspell
Swedish (sv) - https://cgit.freedesktop.org/libreoffice/dictionaries/tree/sv_SE/hyph_sv.dic
Russian (ru) - https://github.com/laboratory50/russian-spellpack

Local Modifications:
The patterns are taken from the hyph-utf8/tex/generic/hyph-utf8/patterns/txt directory.

Only the following patterns are included from that directory:

* Amharic (am)
* Armenian (hy)
* Assamese (as)
* Basque (eu)
* Belarusian (be)
* Bengali (bn)
* Bulgarian (bg)
* Church Slavonic (cu)
* Croation (hr)
* Danish (da)
* Estonian (et)
* Ethiopic script (und-Ethi)
* French (fr)
* Galician (gl)
* German (de)
* Gujarati (gu)
* Hindi (hi)
* Hungarian (hu)
* Irish (ga)
* Kannada (kn)
* Latin (la)
* Latvian (lv)
* Lithuanian (lt)
* Malayalam (ml)
* Marathi (mr)
* Mongolian written in Cyrillic (mn-cyrl)
* Norwegian Bokmål (nb)
* Norwegian Nynorsk (nn)
* Oriya (or)
* Portuguese (pt)
* Punjabi (pa)
* Sanskrit (sa)
* Slovenian (sl)
* Spanish (es)
* Tamil (ta)
* Telugu (te)
* Turkmen (tk)
* Ukrainian (uk)
* US English (en-US)
* Welsh (cy)

The Ethiopic script pattern files have been renamed from mul-ethi to und-ethi,
to properly follow BCP 47 naming.

In case hyphenation patterns are licensed under dual licenses,
hyph-{lang}.lic.txt files are edited to keep only the license Android applies
as listed in the corresponding NOTICE files.

For the German (de) patterns, the *.chr.txt has been automatically generated
based on the contents of the pattern files, using the the create_chr.py script
created by Android engineers.

British English (en-GB) patterns are taken from a different part of the
tex-hyphen repository. Please see the README.android file in the 'en-GB'
directory.

US English (en-US) patterns have been modified to remove patterns added to
the end of the file that had been incorrectly computed based on the TUGboat
hyphenation exception list. To compensate for that, the TUGboat hyphenation
exception list for US English has been downloaded from
<http://mirror.ctan.org/info/digests/tugboat/hyphenex/ushyphex.tex> and
explicitly added to the end of the hyphenation exception file for US
English.

Swedish (sv) patterns are converted to utf-8 and the first three
lines(encoding, leftmin, rightmin) are removed to preserve the hyphenation
pattern only in *.pat.txt

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

No packages published