Encoding italic

Doug Ewell via Unicode unicode at unicode.org
Wed Jan 30 13:06:02 CST 2019

Martin J. Dürst wrote:
> Here's a little dirty secret about these tag characters: They were
> placed in one of the astral planes explicitly to make sure they'd use
> 4 bytes per tag character, and thus quite a few bytes for any actual
> complete tags.
<theory type="conspiracy" serious="false">

Aha. That explains why SCSU had to be banished to the hut, right around
the same time the Plane 14 language tags were deprecated. In SCSU,
astral characters can be 1 byte just like BMP characters.

Doug Ewell | Thornton, CO, US | ewellic.org

More information about the Unicode mailing list