Combining characters
Martin J. Dürst
duerst at it.aoyama.ac.jp
Sun Dec 14 18:31:33 CST 2025
On 2025-12-15 08:07, Alex Shpilkin via Unicode wrote:
>
> On Sun, Dec 14 2025 at 14:02:41 -08:00:00, Asmus Freytag via Unicode
> <unicode at corp.unicode.org> wrote:
>> To make matters more complex, some combining marks are defined to not
>> reorder. Those can be in any order defined by the author and could
>> lead to duplicate encoding for the same display. The reasons behind
>> supporting that are a bit complex, but generally it's done for scripts
>> other than Latin.
>
> Amusingly, study of literal Latin, the language, uses two combining
> marks of the same CCC together as a matter of course: dictionaries mark
> a vowel with (what in NFD would be) the sequence COMBINING MACRON,
> COMBINING BREVE to tell the reader that a syllable’s length either
> varies or cannot be determined.
These two characters are indeed not reordered, but that's not a problem,
because they are stacked. The sequence COMBINING MACRON, COMBINING BREVE
will have the macron between the character and the breve, whereas the
sequence COMBINING BREVE, COMBINING MACRON will have the macron above
the breve. Not an expert, but my assumption is that only the first one
is customary for Latin.
Regards, Martin.
More information about the Unicode
mailing list