Expand description
Character properties and textual analysis.
Modules§
- cluster
- Script aware cluster segmentation.
Structs§
- Analyze
- Iterator that yields Unicode properties and boundary analysis.
This iterator is created by the
analyze
function. - Decompose
- Decomposition of a character.
- Language
- Representation of a language and its associated script and region.
- Properties
- Compact, constant time reference to Unicode properties for a character.
Enums§
- Bidi
Class - Property for the bidirectional algorithm.
- Block
- Unicode block property.
- Bracket
Type - Bracket type of a character.
- Category
- Unicode general category property.
- Cjk
- Chinese, Japanese and Korean languages.
- Cluster
Break - Property for the cluster breaking algorithm.
- Joining
Type - Joining type property.
- Line
Break - Property for the line breaking algorithm.
- Script
- Unicode script property.
- Word
Break - Property for the word breaking algorithm.
- Word
Break Strength - Word breaking strength (corresponds to https://drafts.csswg.org/css-text/#word-break-property).
Constants§
- UNICODE_
VERSION - The version of the Unicode Character Database used to generate character properties.
Traits§
- Codepoint
- Trait that exposes Unicode properties for the
char
type.
Functions§
- analyze
- Returns an iterator yielding unicode properties and boundary analysis for each character in the specified sequence.