Question about Perl5 extended UTF-8 design

Markus Scherer at
Thu Nov 5 12:15:28 CST 2015

On Thu, Nov 5, 2015 at 9:25 AM, Philippe Verdy <verdy_p at> wrote:

> (0xFF was reserved only in the old RFC version of UTF-8 when it allowed
> code points up to 31 bits, but even this RFC is obsolete and should no
> longer be used and it has never been approved by Unicode).

No, even in the original UTF-8 definition, "The octet values FE and FF
never appear."
The highest lead byte was 0xFD.

(For the "really original" version see

In the current definition, "The octet values C0, C1, F5 to FF never
appear." =

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <>

More information about the Unicode mailing list