Combining Characters
Don Hosek
don.hosek at gmail.com
Sun Dec 14 14:25:22 CST 2025
>
> When a Combining Backslash is added for Jovian, well, now that character
> is new and normalization adventures abound.
>
Just one additional note on this: Everything around combining characters,
normalization and grapheme segmentation is data-driven. Other than when new
rules for Indic scripts were introduced with Unicode 15.1.0, the only thing
I’ve needed to update for my Unicode grapheme library has been to import
the newest Unicode data tables. I’ve not written normalization code (yet),
but from everything that I’ve seen on that front, it looks like a similar
thing where again, everything is data-driven.
The only case I can see where things could get weird would be if there
suddenly became some weird case where, e.g., the Jovians insisted that the
combining backslash must appear before the letter and not after it (and
it’s been a few years since I had to really look at the rules and this
might be possible with the existing combining character classes anyway).
-dh
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://corp.unicode.org/pipermail/unicode/attachments/20251214/d636db7d/attachment.htm>
More information about the Unicode
mailing list