Ancient Greek apostrophe marking elision

Richard Wordingham via Unicode unicode at unicode.org
Sun Jan 27 17:38:40 CST 2019


On Sun, 27 Jan 2019 19:57:37 +0000
James Kass via Unicode <unicode at unicode.org> wrote:

> On 2019-01-27 7:09 PM, James Tauber via Unicode wrote:
> > In my original post, I asked if a language-specific tailoring of
> > the text segmentation algorithm was the solution but no one here
> > has agreed so far.  
> If there are likely to be many languages requiring exceptions to the 
> segmentation algorithm wrt U+2019, then perhaps it would be better to 
> establish conventions using ZWJ/ZWNJ and adjust the algorithm 
> accordingly so that it would be cross-languages.  (Rather than
> requiring additional and open ended language-specific tailorings.) (I
> inserted several combinations of ZWJ/ZWNJ into James Tauber's
> example, but couldn't improve the segmentation in LibreOffice,
> although it was possible to make it worse.)

If you look at TR29, you will see that ZWJ should only affect word
boundaries for emoji.  ZWNJ shall have no effect.  What you want is a
control that joins words, but we don't have that.

Richard.



More information about the Unicode mailing list