Tengwar on a general purpose translation site

Richard Wordingham richard.wordingham at ntlworld.com
Wed Mar 16 02:37:16 CDT 2022

On Tue, 15 Mar 2022 17:51:04 -0400
"Mark E. Shoulson via Unicode" <unicode at corp.unicode.org> wrote:

> On 3/15/22 17:07, Richard Wordingham via Unicode wrote:
> > On Sun, 13 Mar 2022 17:41:20 -0600
> > Doug Ewell via Unicode <unicode at corp.unicode.org> wrote:
> >  
> >> Richard Wordingham wrote:  
> >>> Under the 2001 scheme, which proposes an encoding in the SMP, not
> >>> in a PUA, the tehtar would merit being letters, just like the
> >>> non-spacing letter U+0D4E MALAYALAM LETTER DOT REPH.  
> >> The section “Rendering” in the 2001 document seems to me to make
> >> the same statements about modes and tehtar as the CSUR proposal.  
> > Under the former, cons1-tehta-cons2 has tehta displayed on cons1.
> > In the 2001 proposal, a Sindarin font would display the tehta on
> > cons2.  
> If you ask me, it's pretty clear that tehtar are/should be combining 
> characters, like accents or Hebrew vowels.  And yes, then Sindarin
> gets encoded with a non-obvious ordering.  But really, in the context
> of all the various input-method pain people get put through for other
> scripts, is that really so terrible? Even Hebrew codes the furtive
> PATAH after the letter even though it's pronounced before it.
> (That's only one vowel, and not a very common one at that, but still.)

That's an oddity I couldn't find called out in TUS.  I was considering
asking about that.  Apparently furtive pathah is sometimes written
bottom right rather than below.

But the Sindarin vowel positioning also applies to English, which may
still be the language most often used in new Tengwar text.


More information about the Unicode mailing list