Question on BCP 47
Patrick Andries via CLDR-Users
cldr-users at unicode.org
Sat Nov 18 11:48:38 CST 2017
In Windows, certain cultures have an "alternate sort" order (collation).
This list , according to the LCID reference, is:
LCID
Languagetag
Collationtype
0x1007F
x-IV-mathan
Math alphanumeric
0x10407
de-DE_phoneb
Phonebook
0x1040E
hu-HU_tchncl
Technical
0x10437
ka-GE_modern
Modern
0x20804
zh-CN_stroke
Stroke count
0x21404
zh-MO_stroke
"
0x21004
zh-SG_stroke
"
0x30404
zh-TW_pronun
Pronunciation
0x40404
zh-TW_radstr
Radical/stroke
0x40411
ja-JP_radstr
"
0x40C04
zh-HK_radstr
"
0x41404
zh-MO_radstr
"
Some of these "collation"s have equivalent entries among the collation
identifiers defined in the "Unicode locale extension"
(See:unicode.org/repos/
<https://www.unicode.org/repos/cldr/tags/latest/common/bcp47/collation.xml>cldr
<https://www.unicode.org/repos/cldr/tags/latest/common/bcp47/collation.xml>/tags/latest/common/bcp47/collation.xml
<https://www.unicode.org/repos/cldr/tags/latest/common/bcp47/collation.xml>
):
Identifier
Description
big5han
Pinyin orderingfor Latin, big5 charsetorderingfor CJK characters(usedin
Chinese)
compat
A previousversion of the ordering, for compatibility
dict
Dictionarystyle ordering
ducet
The default Unicode collation elementtable order
emoji
Recommendedorderingfor emojicharacters
eor
Europeanorderingrules
gb2312
Pinyin orderingfor Latin, gb2312han charsetorderingfor CJK
characters(usedin Chinese)
phonebk
Phonebookstyle ordering(suchas in German)
phonetic
Phoneticordering(sortingbasedon pronunciation)
pinyin
Pinyin orderingfor Latin and for CJK characters(usedin Chinese)
reformed
Reformedordering(suchas in Swedish)
search
Specialcollation type for string search
searchjl
Specialcollation type for Koreaninitial consonant search
standard
Default orderingfor eachlanguage
stroke
Pinyin orderingfor Latin, stroke orderfor CJK characters(usedin Chinese)
trad
Traditionalstyle ordering(suchas in Spanish)
unihan
Pinyin orderingfor Latin, Unihanradical-stroke orderingfor CJK
characters(usedin Chinese)
zhuyin
Pinyin orderingfor Latin, zhuyinorderfor Bopomofoand CJK
characters(usedin Chinese)
The question is thus the following: if one wants to create a BCP 47
string representing the locale and the options a user has chosen in a
Windows environment, one should be able to represent the "Windows
alternate sorts" in BCP 47 syntax. Some such as "phoneb" have equivalent
entries ("phonebk") but some don't apparently.
If some Windows alternate sorts do not have equivalent entries, should
we request for these to be added to the CLDR, or rather use a "variant
tag", or yet use a "private use" tag in the BCP 47 format?
Patrick Andries
---
L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel antivirus Avast.
https://www.avast.com/antivirus
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://unicode.org/pipermail/cldr-users/attachments/20171118/acbf15cd/attachment-0001.html>
More information about the CLDR-Users
mailing list