Can NFKC turn valid UAX 31 identifiers into non-identifiers?

Richard Wordingham via Unicode unicode at unicode.org
Thu Jun 7 08:08:48 CDT 2018


On Thu, 7 Jun 2018 13:32:13 +0200
Joan Montané via Unicode <unicode at unicode.org> wrote:

> 2018-06-04 21:49 GMT+02:00 Manish Goregaokar via Unicode <
> unicode at unicode.org>:  

> * Ŀ, LATIN CAPITAL LETTER L WITH MIDDEL DOT <U+013F> NFKC decomposes
> to LATIN CAPITAL LETTER L (U+004C) MIDDLE DOT (U+00B7): <L,·>
> * ŀ, LATIN SMALL LETTER L WITH MIDDLE DOT <U+0140> NFKC decomposes to
> LATIN SMALL LETTER L (U+006C) MIDDLE DOT (U+00B7): <l,·>

This is only a problem if U+00B7 is part of Rust's syntax.  U+00B7 has
the properties (X)ID_continue, so there is no formal problem.

Richard.



More information about the Unicode mailing list