RE: “plain text styling”…

Doug Ewell doug at ewellic.org
Mon Jan 9 23:09:51 CST 2023


Mark E. Shoulson wrote:

> That was sort of my question at the outset.  It doesn't make sense to
> call this "plain text" anymore, when it's formatted and styled.
> Styling is almost the *definition* of non-plain text. Unicode is all
> about plain text, where characters represent glyphs (or spaces) that
> represent text.  There are some exceptions to this: [...]
>
> 3. Emoji vs text presentation.
>
> 4. "Extreme" ligaturing involving emoji ZWJ sequences, regional tags
> becoming flags, and other pseudo-encoding.

I would actually consider things like bold, italics, and color to be less of an affront to “plain text” than an emoji presentation form or a sequence that adds up to “woman firefighter with medium-dark skin tone.” Granted ECMA-48 can be used for effects that are less plain-texty than bold, italics, and color.

> So I guess maybe we should restrict the discussion to those "updates
> and clarifications that are specific to Unicode" which you mention.
> What aspects would you consider those to be?  Things like what
> characters are valid to use in the codes or something?

Well, for one, redefining ECMA-48 in terms of Unicode characters instead of bytes, so that (say) one can have UTF-16 with styling, where the Escape character and the bracket and all that are 16 bits wide.

--
Doug Ewell, CC, ALB | Lakewood, CO, US | ewellic.org




More information about the Unicode mailing list