Unpaired surrogates (was: Re: Why Work at Encoding Level?)

Philippe Verdy verdy_p at wanadoo.fr
Mon Oct 19 16:17:46 CDT 2015


2015-10-19 22:32 GMT+02:00 Doug Ewell <doug at ewellic.org>:

> Philippe Verdy wrote:
>
> > No ! The "supplementary code points" (or "supplementary characters"
> > when they are assigned to characters) are represented in UTF-16 as two
> > **code units**, NOT as two "code points" (even if their binary value
> > are related).
>
> Surrogate values are not abstract characters,


I did NOT write that.


> but they are code points
>

That's what I wrote, you reformulate.


> (D10). Note that Surrogate is one of the seven types of code points
> (D10a).
>

I have not denied this. I denied the affirmation of Richard that said that
a single code point (supplementary) could be represented as two code points
(surrogate), and it was wrong for the last word ("point" vs. "unit").
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://unicode.org/pipermail/unicode/attachments/20151019/c27a36bb/attachment.html>


More information about the Unicode mailing list