Richard Wordingham richard.wordingham at ntlworld.com
Tue Jun 30 01:47:46 CDT 2015

On Sat, 27 Jun 2015 17:48:41 +0200 (CEST)
Marcel Schneider <charupdate at orange.fr> wrote:

> On Fri, Jun 26, Richard Wordingham  wrote:
> > On Fri, 26 Jun 2015 12:48:39 +0200 (CEST) Marcel Schneider  wrote:

>>> Still in French, the letter apostrophe, when used as current
>>> apostrophe, prevents the following word from being identified as a
>>> word because of the missing word boundary and, subsequently,
>>> prevents the autoexpand from working. This can be fixed by adding
>>> a word joiner after the apostrophe, thanks to an autocorrect entry
>>> that replaces U+02BC inserted by default in typographic mode, with
>>> U+02BC U+2060.

>> No, this doesn't work. While the primary purpose of U+2060 is to
>> prevent line breaks, it is also used to overrule word boundary
>> detectors in scriptio continua. (It works quite well for
>> spell-checking Thai in LibreOffice). It's name implies to me that it
>> is intended to prevent a word boundary being deduced, through the
>> strong correlation between word boundaries and line break opportunities.
>> There doesn't seem to be a code for 'zero-width word boundary at
>> which lines should not normally be broken'.

> Well, I extrapolated from U+FEFF, which works fine for me, even in
> this particular context.

Does the tool misinterpret U+FEFF between Thai characters as a word
boundary?  Incidentally, which tool are you talking of?


More information about the Unicode mailing list