a character for an unknown character

Richard Wordingham richard.wordingham at ntlworld.com
Fri Dec 30 16:17:12 CST 2016

On Fri, 30 Dec 2016 20:13:41 +0100 (CET)
Marcel Schneider <charupdate at orange.fr> wrote:


> These seem to me identical to U+00B7 and U+2022 respectively. Perhaps
> weʼre here faced with two examples of what Asmus referred to as
> “incorrectly encoded more than once” (talking of “Many other "simple"
> marks: lines, circles, triangles, hooks, and squares, or groups of
> them”).

I was talking about what "fuels the misperception that Unicode somehow
encodes symbols based on a single conventional usage".

> > However, I still don't know whether to spell the word «fiʼ» or
> > «fi’». I've only seen it in print.   

> That depends on the spelling convention.

That's the problem.  I'm not aware of any literature in the language of
the Chtaptisk Fithp in any Terran script - there's more Punic in
Plautus's Poenulus.

> If the apostrophe and the
> single comma quote are disunified, then U+02BC is used to spell the
> word «fiʼ» (your first option). You might also wish to ask the
> publisher, but Iʼm unsure whether he will appreciate to have to join
> publicly one or the other spelling current.

You obviously haven't read the story's discussion of whether the fithp
would honour a peace treaty!

I think the general understanding of the difference is very limited.
For instance, the English wikipedia article about Klingon says, "The
apostrophe, denoting the glottal stop, is considered a letter, not a
punctuation mark", and then goes on to encode it as U+2019!  The French
wikipedia also uses U+2019.


More information about the Unicode mailing list