"A Programmer's Introduction to Unicode"

Philippe Verdy verdy_p at wanadoo.fr
Tue Mar 14 10:35:48 CDT 2017


Per definition yes, but UTC-4 is not Unicode.
As well (any UCS-4 code unit) & 0xFFE00000 == 0 (i.e. 21 bits) is not
Unicode, UTF-32 is Unicode (more restrictive than just 21 bits which would
allow 32 planes instead of just the 17 first ones).
I suppose he meant 21 bits, not 11 bits which covers only a small part of
the BMP.

2017-03-14 16:14 GMT+01:00 Doug Ewell <doug at ewellic.org>:

> Steffen Nurpmeso wrote:
>
> >> I didn’t say you never needed to work with code points. What I said
> >> is that there’s no advantage to UCS-4 as an encoding, and that
> >
> > Well, you do have eleven bits for flags per codepoint, for example.
>
> That's not UCS-4; that's a custom encoding.
>
> (any UCS-4 code unit) & 0xFFE00000 == 0
>
> --
> Doug Ewell | Thornton, CO, US | ewellic.org
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://unicode.org/pipermail/unicode/attachments/20170314/8075ff5b/attachment.html>


More information about the Unicode mailing list