UAX29 9.0.0 Grapheme cluster spec & test discrepancy
Daniel Bünzli
daniel.buenzli at erratique.ch
Tue Jun 21 16:19:31 CDT 2016
Hello,
It seems there's a discrepancy between the tests and the spec for grapheme clusters. In
http://www.unicode.org/Public/9.0.0/ucd/auxiliary/GraphemeBreakTest.txt
we have:
÷ 261D × 0308 × 1F3FB ÷
# ÷ [0.2] WHITE UP POINTING INDEX (E_Base)
# × [9.0] COMBINING DIAERESIS (Extend)
# × [10.0] EMOJI MODIFIER FITZPATRICK TYPE-1-2 (E_Modifier) ÷ [0.3]
which is
http://www.unicode.org/Public/9.0.0/ucd/auxiliary/GraphemeBreakTest.html#r10.0
but the spec doesn't talk about interleaved Extend*:
http://www.unicode.org/reports/tr29/proposed.html#GB10
It seems following the spec this would be:
÷ 261D × 0308 ÷ 1F3FB ÷
which one is right ?
Best,
Daniel
More information about the Unicode
mailing list