Long standing problem with Vedic tone markers and post-base visarga/anusvara
Richard Wordingham via Unicode
unicode at unicode.org
Sat Jan 4 06:50:09 CST 2020
On Thu, 2 Jan 2020 20:20:34 +0000
Richard Wordingham via Unicode <unicode at unicode.org> wrote:
> There's a project whose basis I can't find to convert Indian Indic
> rendering at least to use the USE. Now, according to the
> specification of the USE, visarga, anusvara and cantillation marks
> are all classified as vowel modifiers, and are so ordered relative to
> one another in the Indian Indic order: left, top, bottom, right. So,
> the problem should already be solved for Grantha, and, if the plans
> come to fruition, will work with a font whose Devanagari script tag
> is 'dev3'. However, I may have overlooked a set of overrides to the
> USE categorisations.
I've now knocked up a partial* representation* of a Devanagari dev3 and
a Grantha font (which I'm dubbing 'Mock Indic 3'). The supported
orders of COMBINING DIGIT ONE and VISARGA, as in Firefox on
Linux, are:
dev2: कः꣡ <U+915 LETTER KA, U+903 VISARGA, U+A8E1 COMBINING DEVANAGARI
DIGIT ONE>
dev3: क꣡ः <U+0915, U+A8E1, U+0903>
Grantha: (1) <U+11315 LETTER KA, U+11367 COMBINING GRANTHA DIGIT
ONE, U+11303 VISARGA>
(2) <U+11315, U+11303, U+11367>
The second Grantha spelling is enabled by a Harfbuzz-only change to
the USE categorisations. It treats Grantha visarga and spacing
anusvara as though inpc=Top rather than inpc=Right. As I am using
Ubuntu 16.04, this override isn't supported in applications that use the
system HarfBuzz library, such as my email client.
We are now establishing incompatible Devanagari font-specific
encodings fully compliant with TUS!
Richard.
* Partial = much is not handled
* Representation = glyphs are wrong, merely showing arrangement. (I've
actually re-used a Tai Tham font.)
More information about the Unicode
mailing list