UAX29 9.0.0 Grapheme cluster spec & test discrepancy

Daniel Bünzli daniel.buenzli at erratique.ch
Tue Jun 21 16:19:31 CDT 2016


Hello,

It seems there's a discrepancy between the tests and the spec for grapheme clusters. In

 http://www.unicode.org/Public/9.0.0/ucd/auxiliary/GraphemeBreakTest.txt  

we have:  

÷ 261D × 0308 × 1F3FB ÷  
# ÷ [0.2] WHITE UP POINTING INDEX (E_Base)  
# × [9.0] COMBINING DIAERESIS (Extend)  
# × [10.0] EMOJI MODIFIER FITZPATRICK TYPE-1-2 (E_Modifier) ÷ [0.3]

which is  

 http://www.unicode.org/Public/9.0.0/ucd/auxiliary/GraphemeBreakTest.html#r10.0

but the spec doesn't talk about interleaved Extend*:  

 http://www.unicode.org/reports/tr29/proposed.html#GB10

It seems following the spec this would be:  

÷ 261D × 0308 ÷ 1F3FB ÷

which one is right ?

Best,  

Daniel





More information about the Unicode mailing list