Unicode locale ID vs. POSIX variant

Markus Scherer markus.icu at gmail.com
Tue Sep 30 15:01:44 CDT 2014


Please help me understand the POSIX locale variant.

I assume that en_US_POSIX is still valid in the old syntax. For example,
see common/collation/en_US_POSIX.xml
<http://unicode.org/cldr/trac/browser/trunk/common/collation/en_US_POSIX.xml>

I assume that en-US-POSIX is a valid Unicode Language Identifier
<http://www.unicode.org/reports/tr35/tr35.html#Unicode_language_identifier>
(new
syntax) because CLDR supplemental metadata includes POSIX as a valid
variant.

CLDR also defines -u-va-posix.

It looks like in non-CLDR BCP 47, en-US-POSIX is not valid because POSIX is
not a registered language subtag.

Legacy Variants
<http://www.unicode.org/reports/tr35/tr35.html#Legacy_Variants> says to
convert the old-syntax variant POSIX to -u-va-posix.

Should a Unicode Language Identifier use variant POSIX but when converting
to non-CLDR BCP 47 convert that variant to -u-va-posix? Or should one
always convert from old syntax to new -u-va-posix (just in case the
recipient only understands BCP 47)?

When and where else should one convert between the POSIX variant and the
-u-va-posix extension?

Part of the problem is that old and new syntax are indistinguishable when
there is no '@' and no singleton subtag, and - and _ are both accepted as
separators, as usual.

Is en-US-POSIX-u-va-posix valid?
Is it the same as en-US-u-va-posix?

*References:*

supplementalMetadata.xml
<http://unicode.org/cldr/trac/browser/trunk/common/supplemental/supplementalMetadata.xml>
includes
"POSIX" in <variable id="$variant" type="choice">

I see http://www.unicode.org/reports/tr35/tr35.html#Key_Type_Definitions

Locale variant

*bcp47/variant.xml*
"va"Common variant type"posix"POSIX style locale variant
and http://unicode.org/repos/cldr/trunk/common/bcp47/variant.xml
<key name="va" description="Common locale variant type key">
<type name="posix" description="POSIX style locale variant"/>

https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry
does not mention POSIX.

markus
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://unicode.org/pipermail/cldr-users/attachments/20140930/4521db80/attachment.html>


More information about the CLDR-Users mailing list