-
Notifications
You must be signed in to change notification settings - Fork 8
/
README
71 lines (48 loc) · 1.86 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
===========================================================
Unicode-LineBreak - UAX #14 Unicode Line Breaking Algorithm
===========================================================
Unicode-LineBreak Package is Copyright (C) 2009-2018,
by Hatuka*nezumi - IKEDA Soji.
This package is free software; you can redistribute it and/or modify it
under the same terms as Perl itself.
Prerequisites
=============
Perl 5.8.0 or later is required.
Sombok library package is required. If Sombok had not been installed,
bundled source will be used.
https://sourceforge.net/projects/linefold/files/
Optionally, LibThai package is needed to support Thai word segmentation:
http://linux.thai.net/projects/libthai/
Additionally, pkg-config will be required for libthai and/or shared
sombok library.
Install
=======
To build and install Unicode-LineBreak package, do:
$ perl Makefile.PL
$ make
$ make test
# make install
If you wish to disable libthai feature explicitly, do:
$ perl Makefile.PL --with-bundled-sombok --disable-libthai
$ make
$ make test
# make install
Documentations
==============
Three main modules and some supporting program files are contained. For
more details read following POD documentations:
Text::LineFold - Line Folding for Plain Text
Unicode::GCString - String as Sequence of UAX #29 Grapheme Clusters
Unicode::LineBreak - UAX #14 Unicode Line Breaking Algorithm
For japonophones, PODs in Japanese language are also included:
POD2::JA::Text::LineFold - プレインテキストの行折り
POD2::JA::Unicode::GCString - UAX #29 書記素クラスタの列としての文字列
POD2::JA::Unicode::LineBreak - UAX #14 Unicode 行分割アルゴリズム
For Other Language
==================
Python
pytextseg: http://pypi.python.org/pypi/pytextseg/
Author
======
Hatuka*nezumi - IKEDA Soji <hatuka(at)nezumi.nu>.
$$