Ambiguous hyphenation cases with

Kess Vargavind vargavind at
Tue Jul 22 11:24:36 CDT 2014

There actually is one simple solution that I sometimes use: do not contract
three consecutive same-letter consonants at all! That is, do like Icelandic
and write food thief as <mattjuv> and carpet thief as <matttjuv>. Then
there is no trouble hyphenating.

Yes, this goes against current spelling rules in Swedish, but it works. And
until there is better hyphenation support for corner cases like this
(either at character level or higher) that is how I have ‘solved’ it when
unable to do manual tweaking.

Would it be logical to add a character similar to U+00AD SOFT HYPHEN (shy)
that says “you can break me here, but unless you do please skip the
previous character (however such would be defined in a case like this)”?
Such that <matt[SHY-LIKE-CHAR]tjuv> is either rendered <mattjuv> or broken
up as <matt-tjuv>.


2014-07-22 16:03 GMT+02:00 fantasai <fantasai.lists at>:

> On 05/12/2014 12:43 AM, Håkan Save Hansson wrote:
>> Hi fantasai,
>> Regarding your answer to my second suggestion (if you are referring
>> to James Clarks first answer):
>> The problem is that the hyphenation system in itself can't decide how
>> to change the spelling, without any "dictionary"   functionality. It
>> can't know if I meant "mat-tjuv" ("food thief" in Swedish) or "matt-tjuv"
>> ("carpet thief") when I wrote "mat­tjuv". So there has to be a way
>> to tell the hyphenation system that.
> Hm. I don't think I have a solution for that problem. :/ Currently you'd
> just have to not hyphenate that word.
> CCing Unicode, in case anyone there has a solution
> Up-reference:
> html
> ~fantasai
> _______________________________________________
> Unicode mailing list
> Unicode at
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <>

More information about the Unicode mailing list