Interpreting t-h0- mechanism

Mark Davis ☕️ via CLDR-Users cldr-users at unicode.org
Sun Oct 20 08:46:56 CDT 2019


On Sat, Oct 19, 2019 at 1:48 PM Matthew Stuckwisch via CLDR-Users <
cldr-users at unicode.org> wrote:

> The T extension for BCP-47 are contained in CLDR and described at length
> in TR35, so I hope this is the correct forum for the question.  I'm
> implementing a module for handling language tags and have a few questions
> relating to the h0 tag:
>
> 1. What is the difference between 'en-t-es-h0-hybrid' and 'en-t-h0-es'?
> Both styles are given as example encodings for what would be Spanglish
> (Spanish-English hybrid), but surely there ought to be a difference
>

The latter style is illegal. The key h0 only currently takes one possible
value, h0-hybrid.

https://unicode.org/reports/tr35/tr35.html#Hybrid_Locale

If you are seeing "h0-es" someplace in the spec, please let us know, since
that would be a typo.

2. What would be the interpretation of 'en-t-pt-h0-es'?  English,
> transformed from Portuñol (Spanish-Portuguese hybrid), or Spanglish,
> transformed from Portuguese?
>

That is illegal, so it doesn't have any interpretation.

> 3. What would the correct method for indicating, for instance, Portuñol
> transformed from Franglish (French-English hybrid)?
>

That cannot currently be represented. You can file a ticket if you want to
present a case for why that would be important.

> 4. What is the format for a triple language hybrid?  Would
> Asturian-Portuguese-Spanish be 'ast-t-h0-pt-h0-es'?
>

That is illegal, so it doesn't have any interpretation. There is no syntax
for a mixture of more than 2 languages. You can file a ticket if you want
to present a case for why that would be important.


> TR35 is fairly anemic in the h0 mechanism documentation, and particularly
> with regards to (4), it seems like much was intentionally left as an open
> question to be handled at a later date, hence my confusion.
>

The validity for key-type combinations is established by referring to the
https://github.com/unicode-org/cldr/tree/master/common/bcp47 data. Between
that and https://unicode.org/reports/tr35/tr35.html#Hybrid_Locale, if there
are any "open questions", please file a new ticket so that we can fix them.


> Matéu
> _______________________________________________
> CLDR-Users mailing list
> CLDR-Users at unicode.org
> http://unicode.org/mailman/listinfo/cldr-users
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://unicode.org/pipermail/cldr-users/attachments/20191020/06c28200/attachment.html>


More information about the CLDR-Users mailing list