Armenian Mijaket (Armenian colon)

Philippe Verdy via Unicode unicode at unicode.org
Tue Dec 5 13:46:22 CST 2017


Note that "Noto Sans Armenian" does not even map U+2024 (I doubt it is
accepted as a real replacement for the missing Armenian mijaket which plays
a role similar to a Latin semicolon or colon), it does match the hyphen at
U+2010. But U+0589 (Armenian "versakjet", the Armenian full stop that looks
like a ":" Latin colon) is mapped.

My opinion is that the one dot leader has only been used by some sources
that don't need to render tabular data or TOCs: these sources needing these
traditional distinctions are probably religious texts, and clearly they
don't even look like what is in the Unicode PDF for the representative
glyph, and "Noto Sans Armenian" is designed for modern use on display and
even there we'll need a better distinction and better metrics than going
with the possible "Noto Sans" mapping of the leader dot at U+2024 (which
still does not exist: in fact leaders are better represented another way
than by repeating this character: leaders are essially parsed in arbitrary
lengths like a tabulation whitespace and so the leader dot is not
semantically suitable at all as a mijaket (it's just like if we wanted to
replace ASCII full stops or colons and semicolons in English by SPACE or
TAB: in Armenian this just causes havoc).


2017-12-05 20:28 GMT+01:00 Philippe Verdy <verdy_p at wanadoo.fr>:

> U+2024 is not supported in any fonts I have loaded. A websearch of mijaket
> gives nothing.
> U+20224 is used as a "leader dot", and does not match the expected metrics
> (it is  certainly not a mijaket, it should be more like U+0589, i.e. as a
> bold parallelogram, and not a thin leader dot).
>
> Leader dots are NOT used as real punctuation, they are presentational, for
> example in TOC (table of contents), where they are aligned in arbitrarily
> long rows.
>
> The note in http://www.unicode.org/charts/PDF/U2000.pdf is absolutely not
> normative and in fact it is wrong in my opinion.
>
> The mijaket (Armenian colon) should be encoded (preferably at U+0588 in
> the Armenian block) as it also has to be distinguisdhed from leader dots in
> Armenian TOC, exactly like the vertsaket was distinguished at U+0589.
>
>
> 2017-12-05 19:59 GMT+01:00 S. Gilles <sgilles at math.umd.edu>:
>
>> On 2017-12-05T18:44:05+0100, Philippe Verdy via Unicode wrote:
>> > The Armenian script has its own distinctive punctuation (vertsaket) for
>> the
>> > standard full stop at end of sentence (whose glyph looks very much like
>> the
>> > Basic Latin/ASCII colon, however slighly more bold and slanted and whose
>> > dots are rectangular). It is encoded at U+0589. And used in traditional
>> > texts instead of the "modern" full stop.
>> >
>> > But Armenian also has its own distinctive puctuation (mijaket) for the
>> > introductory colon between two phrases of the same sentence (whose glyph
>> > looks very much like the Basic Latin/ASCII full stop). It is not encoded
>> > and I don't like using the ASCII full stop where it causes confusion.
>> >
>> > Where is the Armenian distinctive mijaket? Shouldn't it be encoded at
>> > U+0588?
>>
>> Off-list because I generally don't know what I'm talking about, but
>> grepping NamesList.txt for ‘mijaket’ gives U+2024. If this isn't
>> what you're looking for, my apologies.
>>
>> --
>> S. Gilles
>>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://unicode.org/pipermail/unicode/attachments/20171205/983b9e06/attachment.html>


More information about the Unicode mailing list