ICU 4.8.1.1
4.8.1.1
|
Normalization filtered by a UnicodeSet. More...
#include <normalizer2.h>
Public Member Functions | |
FilteredNormalizer2 (const Normalizer2 &n2, const UnicodeSet &filterSet) | |
Constructs a filtered normalizer wrapping any Normalizer2 instance and a filter set. | |
virtual UnicodeString & | normalize (const UnicodeString &src, UnicodeString &dest, UErrorCode &errorCode) const |
Writes the normalized form of the source string to the destination string (replacing its contents) and returns the destination string. | |
virtual UnicodeString & | normalizeSecondAndAppend (UnicodeString &first, const UnicodeString &second, UErrorCode &errorCode) const |
Appends the normalized form of the second string to the first string (merging them at the boundary) and returns the first string. | |
virtual UnicodeString & | append (UnicodeString &first, const UnicodeString &second, UErrorCode &errorCode) const |
Appends the second string to the first string (merging them at the boundary) and returns the first string. | |
virtual UBool | getDecomposition (UChar32 c, UnicodeString &decomposition) const |
Gets the decomposition mapping of c. | |
virtual UBool | isNormalized (const UnicodeString &s, UErrorCode &errorCode) const |
Tests if the string is normalized. | |
virtual UNormalizationCheckResult | quickCheck (const UnicodeString &s, UErrorCode &errorCode) const |
Tests if the string is normalized. | |
virtual int32_t | spanQuickCheckYes (const UnicodeString &s, UErrorCode &errorCode) const |
Returns the end of the normalized substring of the input string. | |
virtual UBool | hasBoundaryBefore (UChar32 c) const |
Tests if the character always has a normalization boundary before it, regardless of context. | |
virtual UBool | hasBoundaryAfter (UChar32 c) const |
Tests if the character always has a normalization boundary after it, regardless of context. | |
virtual UBool | isInert (UChar32 c) const |
Tests if the character is normalization-inert. |
Normalization filtered by a UnicodeSet.
Normalizes portions of the text contained in the filter set and leaves portions not contained in the filter set unchanged. Filtering is done via UnicodeSet::span(..., USET_SPAN_SIMPLE). Not-in-the-filter text is treated as "is normalized" and "quick check yes". This class implements all of (and only) the Normalizer2 API. An instance of this class is unmodifiable/immutable but is constructed and must be destructed by the owner.
Definition at line 315 of file normalizer2.h.
FilteredNormalizer2::FilteredNormalizer2 | ( | const Normalizer2 & | n2, |
const UnicodeSet & | filterSet | ||
) | [inline] |
Constructs a filtered normalizer wrapping any Normalizer2 instance and a filter set.
Both are aliased and must not be modified or deleted while this object is used. The filter set should be frozen; otherwise the performance will suffer greatly.
n2 | wrapped Normalizer2 instance |
filterSet | UnicodeSet which determines the characters to be normalized |
Definition at line 327 of file normalizer2.h.
virtual UnicodeString& FilteredNormalizer2::append | ( | UnicodeString & | first, |
const UnicodeString & | second, | ||
UErrorCode & | errorCode | ||
) | const [virtual] |
Appends the second string to the first string (merging them at the boundary) and returns the first string.
The result is normalized if both the strings were normalized. The first and second strings must be different objects.
first | string, should be normalized |
second | string, should be normalized |
errorCode | Standard ICU error code. Its input value must pass the U_SUCCESS() test, or else the function returns immediately. Check for U_FAILURE() on output or use with function chaining. (See User Guide for details.) |
Implements Normalizer2.
virtual UBool FilteredNormalizer2::getDecomposition | ( | UChar32 | c, |
UnicodeString & | decomposition | ||
) | const [virtual] |
Gets the decomposition mapping of c.
Equivalent to normalize(UnicodeString(c)) on a UNORM2_DECOMPOSE Normalizer2 instance, but much faster. This function is independent of the mode of the Normalizer2.
c | code point |
decomposition | String object which will be set to c's decomposition mapping, if there is one. |
Implements Normalizer2.
virtual UBool FilteredNormalizer2::hasBoundaryAfter | ( | UChar32 | c | ) | const [virtual] |
Tests if the character always has a normalization boundary after it, regardless of context.
For details see the Normalizer2 base class documentation.
c | character to test |
Implements Normalizer2.
virtual UBool FilteredNormalizer2::hasBoundaryBefore | ( | UChar32 | c | ) | const [virtual] |
Tests if the character always has a normalization boundary before it, regardless of context.
For details see the Normalizer2 base class documentation.
c | character to test |
Implements Normalizer2.
virtual UBool FilteredNormalizer2::isInert | ( | UChar32 | c | ) | const [virtual] |
Tests if the character is normalization-inert.
For details see the Normalizer2 base class documentation.
c | character to test |
Implements Normalizer2.
virtual UBool FilteredNormalizer2::isNormalized | ( | const UnicodeString & | s, |
UErrorCode & | errorCode | ||
) | const [virtual] |
Tests if the string is normalized.
For details see the Normalizer2 base class documentation.
s | input string |
errorCode | Standard ICU error code. Its input value must pass the U_SUCCESS() test, or else the function returns immediately. Check for U_FAILURE() on output or use with function chaining. (See User Guide for details.) |
Implements Normalizer2.
virtual UnicodeString& FilteredNormalizer2::normalize | ( | const UnicodeString & | src, |
UnicodeString & | dest, | ||
UErrorCode & | errorCode | ||
) | const [virtual] |
Writes the normalized form of the source string to the destination string (replacing its contents) and returns the destination string.
The source and destination strings must be different objects.
src | source string |
dest | destination string; its contents is replaced with normalized src |
errorCode | Standard ICU error code. Its input value must pass the U_SUCCESS() test, or else the function returns immediately. Check for U_FAILURE() on output or use with function chaining. (See User Guide for details.) |
Implements Normalizer2.
virtual UnicodeString& FilteredNormalizer2::normalizeSecondAndAppend | ( | UnicodeString & | first, |
const UnicodeString & | second, | ||
UErrorCode & | errorCode | ||
) | const [virtual] |
Appends the normalized form of the second string to the first string (merging them at the boundary) and returns the first string.
The result is normalized if the first string was normalized. The first and second strings must be different objects.
first | string, should be normalized |
second | string, will be normalized |
errorCode | Standard ICU error code. Its input value must pass the U_SUCCESS() test, or else the function returns immediately. Check for U_FAILURE() on output or use with function chaining. (See User Guide for details.) |
Implements Normalizer2.
virtual UNormalizationCheckResult FilteredNormalizer2::quickCheck | ( | const UnicodeString & | s, |
UErrorCode & | errorCode | ||
) | const [virtual] |
Tests if the string is normalized.
For details see the Normalizer2 base class documentation.
s | input string |
errorCode | Standard ICU error code. Its input value must pass the U_SUCCESS() test, or else the function returns immediately. Check for U_FAILURE() on output or use with function chaining. (See User Guide for details.) |
Implements Normalizer2.
virtual int32_t FilteredNormalizer2::spanQuickCheckYes | ( | const UnicodeString & | s, |
UErrorCode & | errorCode | ||
) | const [virtual] |
Returns the end of the normalized substring of the input string.
For details see the Normalizer2 base class documentation.
s | input string |
errorCode | Standard ICU error code. Its input value must pass the U_SUCCESS() test, or else the function returns immediately. Check for U_FAILURE() on output or use with function chaining. (See User Guide for details.) |
Implements Normalizer2.