Is ARMENIAN ABBREVIATION MARK (՟, U+055F) misclassified?

Richard Wordingham via Unicode unicode at
Sat Apr 27 03:38:54 CDT 2019

On Sat, 27 Apr 2019 00:08:52 +0000
James Kass via Unicode <unicode at> wrote:

> On 2019-04-26 11:08 PM, Doug Ewell via Unicode wrote:
> > This is a small percentage of the number of fonts that have all
> > four of these Armenian glyphs, but show the abbreviation mark as a
> > spacing glyph. It looks like Unicode is right, Wikipedia is right,
> > and the fonts are wrong.  
> If the Wikipedia page(s) are correct, then Unicode isn't.  Unicode 
> charts don't show the glyph on the dotted circle and the canonical 
> combining class is shown as "spacing".  The fact that Doug Ewell
> found some installed fonts displaying the character as a combining
> mark suggests that the Wikipedia pages are correct.  This character
> is listed as being unused in modern Armenian, but you'd think that it
> would have been exposed before now since the charcter has been in
> Unicode since version 1.0.

Well, ccc=0 is entirely permissible for non-spacing marks, though I find
it an invitation to misspell words.

I think the most important admissible issue is one of word boundaries.
U+055F has line_break=alphabetic, but word_break=other.  The latter
doesn't seem very friendly towards spell checkers, but perhaps there is
a good reason for it.  Word_break=other is not compatible with being a
non-spacing mark.

Another important, but probably inadmissible, issue is that of the
effect on editing. Life is easier if one can easily change the character
preceding the abbreviation sign; this would often be difficult if the
abbreviation sign were a combining mark.

What advantages would accrue from changing U+055F from Po to Mn?


More information about the Unicode mailing list