Call for feedback on UTS #18: Unicode Regular Expressions
Mark Davis ☕️ via Unicode
unicode at unicode.org
Thu Jan 2 13:22:00 CST 2020
The line just above that is:
Name matching rules follow Matching Rules
<http://unicode.org/reports/tr44/#Matching_Rules> from [UAX44#UAX44-LM2
The deletion was based on feedback that the deleted text was a recap of the
above line, but a recap that didn't have precisely the same description.
It's best to point to the exact description, and have that be in one place.
On Thu, Jan 2, 2020 at 6:40 PM Karl Williamson via Unicode <
unicode at unicode.org> wrote:
> One thing I noticed in reviewing this is the removal of text about loose
> matching of the name property. But I didn't see an explanation for this
> removal. Please point me to the explanation, or tell me what it is.
> Specifically these lines were removed:
> As with other property values, names should use a loose match,
> disregarding case, spaces and hyphen (the underbar character "_" cannot
> occur in Unicode character names). An implementation may also choose to
> allow namespaces, where some prefix like "LATIN LETTER" is set globally
> and used if there is no match otherwise.
> There are, however, three instances that require special-casing with
> loose matching, where an extra test shall be made for the presence or
> absence of a hyphen.
> U+0F68 TIBETAN LETTER A and
> U+0F60 TIBETAN LETTER -A
> U+0FB8 TIBETAN SUBJOINED LETTER A and
> U+0FB0 TIBETAN SUBJOINED LETTER -A
> U+116C HANGUL JUNGSEONG OE and
> U+1180 HANGUL JUNGSEONG O-E
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Unicode