|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||
See:
Description
| Class Summary | |
|---|---|
| DefaultICUTokenizerConfig | Default ICUTokenizerConfig that is generally applicable
to many languages. |
| ICUTokenizer | Breaks text into words according to UAX #29: Unicode Text Segmentation (http://www.unicode.org/reports/tr29/) |
| ICUTokenizerConfig | Class that allows for tailored Unicode Text Segmentation on a per-writing system basis. |
| LaoBreakIterator | Syllable iterator for Lao text. |
Tokenizer that breaks text into words with the Unicode Text Segmentation algorithm.
|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||