Can NFKC turn valid UAX 31 identifiers into non-identifiers?
Richard Wordingham via Unicode
unicode at unicode.org
Thu Jun 7 08:08:48 CDT 2018
On Thu, 7 Jun 2018 13:32:13 +0200
Joan Montané via Unicode <unicode at unicode.org> wrote:
> 2018-06-04 21:49 GMT+02:00 Manish Goregaokar via Unicode <
> unicode at unicode.org>:
> * Ŀ, LATIN CAPITAL LETTER L WITH MIDDEL DOT <U+013F> NFKC decomposes
> to LATIN CAPITAL LETTER L (U+004C) MIDDLE DOT (U+00B7): <L,·>
> * ŀ, LATIN SMALL LETTER L WITH MIDDLE DOT <U+0140> NFKC decomposes to
> LATIN SMALL LETTER L (U+006C) MIDDLE DOT (U+00B7): <l,·>
This is only a problem if U+00B7 is part of Rust's syntax. U+00B7 has
the properties (X)ID_continue, so there is no formal problem.
Richard.
More information about the Unicode
mailing list