Potential contradiction between the WordBreak test data and UAX #29

Daniel Bünzli daniel.buenzli at erratique.ch
Wed Nov 23 04:52:56 CST 2016


On Wednesday 23 November 2016 at 11:22, Tom Hacohen wrote:
> Thank you for your reply, but I don't think the UAX, specifically the
> line you quoted implies that. The line you quoted says that the process 
> is terminated when a rule matches and produces a boundary status. In 
> Table 1[1], the right-arrow (which is used in rule 4) is listed as a 
> boundary symbol, 

Precisely, rules with this *symbol* do not produce a boundary *status* which is either boundary or not boundary as mentioned in parens in the line I quoted.
 
> so I would argue that one should stop the process and start it again from the start.

At least in the current UAX there is no mention of an idea of stopping and restarting the process at all.

Best, 

Daniel


More information about the Unicode mailing list