Another UAX #29 bug: property tables need updating
manish at mozilla.com
Thu Dec 22 12:35:55 CST 2016
The spec lists GraphemeBreakProperty.txt and
WordBreakProperty.txt as the normative source for grapheme and word
However, the spec also gives non-normative definitions of these
properties. In particular, it defines Glue_After_Zwj as
> Emoji characters that do not break from a previous ZWJ in a defined emoji zwj sequence, and are not listed as Emoji_Modifier_Base=Yes in emoji-data.txt. See [UTR51].
Going through emoji-zwj-sequences.txt, there are a lot of emoji
characters that satisfy this property. The kiss/heart emojis are like
this, as well as every object emoji in the "Gendered Role, with
object" section. However, we only count the kiss, heart, and speech
bubble emoji as GAZ in the property table.
The property table should include all role and gender modifiers as GAZ.
Could this be updated?
More information about the Unicode