How to disable Indic syllable form editing in MS word

Shriramana Sharma via Indic indic at unicode.org
Sun Dec 10 00:24:06 CST 2017


On 12/8/17, Richard Wordingham via Indic <indic at unicode.org> wrote:
>> In LibreOffice, one benefits from the fact that
>> grapheme clusters only include one full consonant - K.SSA is the
>> grapheme cluster <KA, VIRAMA> followed by a grapheme cluster starting
>> with SSA.
>
> However, if the current proposals for UAX#29 go through and there are
> then no longer any extended grapheme cluster breaks in <KA, VIRAMA,
> SSA>, I fear it will no longer be easy to insert ZWNJ between virama
> and a following consonant.

I noticed the other thread in which you are participating, but unable
to read through and understand due to lack of time to grok the
technicalities.

I notice that in the case of some platforms/apps, for example the
Firefox on Kubuntu 16.04 that I'm using right now, if I place the
cursor before or after a cluster like क्ष्य and use the cursor keys,
the visual cursor doesn't jump the cluster but traverses it
progressively in N steps where N is one more than the number of
viramas inside it. At each step, the logical cursor seems to be placed
*after* a virama.

Are you saying the proposed update to UAX#29 is going to prohibit this
behaviour? That may not be advisable. Why are they trying to do it?

However, I should also note that while this behaviour seems quite
sensible for C1 conjoining cases, it won't help to insert joiners to
request C2-conjoining forms where the ZWJ needs to be put *before* the
virama. For instance in Kannada to get RA + post-base YA as in ರ‍್ಯ
the sequence is ರ, ZWJ, ್, ಯ. This can only be achieved in initial
input as I said earlier, because post-input, the cursor will only be
placed internally *after* the virama, and putting a ZWJ there just
breaks the cluster like ZWNJ: ರ್‍ಯ. This is because there is no
defined behaviour for Virama + ZWJ in Kannada.

But I presume Kannadigas can live with that (though I am not one
myself) because such sequences aren't frequently used at all. (In fact
most common users probably aren't aware that they exist.)

OTOH the requirement of inputting ZWJ in Devanagari to inhibit
ligatures in some over-enthusiastic fonts (since such ligatures are
sometimes not uniquely identifiable at  12 points) is a somewhat
*more* often experienced one among those typesetting Devanagari
documents, especially Sanskrit language ones with heavy cluster use.
So it would be useful to retain the behaviour of placing cursors after
intra-cluster virama-s.

-- 
Shriramana Sharma ஶ்ரீரமணஶர்மா श्रीरमणशर्मा ������������������������



More information about the Indic mailing list