How to disable Indic syllable form editing in MS word

maxwell via Indic indic at unicode.org
Thu Dec 7 16:18:23 CST 2017


On 2017-12-07 16:38, Richard Wordingham wrote:
> On Thu, 07 Dec 2017 11:33:57 -0500
> maxwell via Indic <indic at unicode.org> wrote:
> 
>> Why are these not treated in the Unicode standard as analogous to
>> base+diacritic pairs with respect to NCC and NCD?  E.g. when you
>> convert text to NCC, why isn't a sequence of U+09C7 + consonant +
>> U+09BE converted to consonant + U+09CB, and vice versa when
>> converting to NCD?
> 
> I'm puzzled by what you say.  What *looks* like <U+09C7, consonant ,
> U+09BE> should, if it is represented by three characters, be encoded as
> <consonant, U+09C7, U+09BE>,

You're of course right, I got the underlying order wrong.

> which is indeed canonically equivalent to
> <consonant, U+09CB>.

It's canonically equivalent--that was what I was trying to say--but the 
last time I tested this using Python's Unicode conversion between NFC 
and NFD, I was sure it did not handle this case.  But I tried it just 
now, and it did work; so either my test was wrong before, or this has 
now been fixed in Python.  In either case, my earlier message was wrong.

    Mike Maxwell
    University of Maryland



More information about the Indic mailing list