Counting Devanagari Aksharas

Richard Wordingham via Unicode unicode at
Thu Apr 20 14:14:10 CDT 2017

On Thu, 20 Apr 2017 11:17:05 -0700
Manish Goregaokar via Unicode <unicode at> wrote:

> When given a rendered representation people seem to uniformly count
> conjuncts as multiple aksharas if rendered with visible halant, and as
> a single akshara if they are rendered conjoined.

Now, that's what I expected.

> I'm of the opinion that Unicode should start considering devanagari
> (and possibly other indic) consonant clusters as single extended
> grapheme clusters. Yes, sometimes it's not rendered as a single glyph,
> but sometimes family emoji will not render as a single glyph either
> (if you use skin tones or more than 4 family members) and we still
> consider those EGCs.

You won't like it if cursor movement granularity is reduced to one
extended grapheme cluster.  I'm grateful that Emacs allows me to
delete and replace the first NFC character of a grapheme cluster.


More information about the Unicode mailing list