Question on BCP 47

Patrick Andries via CLDR-Users cldr-users at unicode.org
Sat Nov 18 11:48:38 CST 2017


In Windows, certain cultures have an "alternate sort" order (collation).
This list , according to the LCID reference, is:

LCID

	

Languagetag

	

Collationtype

0x1007F

	

x-IV-mathan

	

Math alphanumeric

0x10407

	

de-DE_phoneb

	

Phonebook

0x1040E

	

hu-HU_tchncl

	

Technical

0x10437

	

ka-GE_modern

	

Modern

0x20804

	

zh-CN_stroke

	

Stroke count

0x21404

	

zh-MO_stroke

	

"

0x21004

	

zh-SG_stroke

	

"

0x30404

	

zh-TW_pronun

	

Pronunciation

0x40404

	

zh-TW_radstr

	

Radical/stroke

0x40411

	

ja-JP_radstr

	

"

0x40C04

	

zh-HK_radstr

	

"

0x41404

	

zh-MO_radstr

	

"


Some of these "collation"s  have equivalent entries among the collation 
identifiers defined in the "Unicode locale extension"
(See:unicode.org/repos/ 
<https://www.unicode.org/repos/cldr/tags/latest/common/bcp47/collation.xml>cldr 
<https://www.unicode.org/repos/cldr/tags/latest/common/bcp47/collation.xml>/tags/latest/common/bcp47/collation.xml 
<https://www.unicode.org/repos/cldr/tags/latest/common/bcp47/collation.xml> 
):

Identifier

	

Description

big5han

	

Pinyin orderingfor Latin, big5 charsetorderingfor CJK characters(usedin 
Chinese)

compat

	

A previousversion of the ordering, for compatibility

dict

	

Dictionarystyle ordering

ducet

	

The default Unicode collation elementtable order

emoji

	

Recommendedorderingfor emojicharacters

eor

	

Europeanorderingrules

gb2312

	

Pinyin orderingfor Latin, gb2312han charsetorderingfor CJK 
characters(usedin Chinese)

phonebk

	

Phonebookstyle ordering(suchas in German)

phonetic

	

Phoneticordering(sortingbasedon pronunciation)

pinyin

	

Pinyin orderingfor Latin and for CJK characters(usedin Chinese)

reformed

	

Reformedordering(suchas in Swedish)

search

	

Specialcollation type for string search

searchjl

	

Specialcollation type for Koreaninitial consonant search

standard

	

Default orderingfor eachlanguage

stroke

	

Pinyin orderingfor Latin, stroke orderfor CJK characters(usedin Chinese)

trad

	

Traditionalstyle ordering(suchas in Spanish)

unihan

	

Pinyin orderingfor Latin, Unihanradical-stroke orderingfor CJK 
characters(usedin Chinese)

zhuyin

	

Pinyin orderingfor Latin, zhuyinorderfor Bopomofoand CJK 
characters(usedin Chinese)



The question is thus the following: if one wants to create a BCP 47 
string representing the locale and the options a user has chosen in a 
Windows environment, one should be able to represent the "Windows 
alternate sorts" in BCP 47 syntax. Some such as "phoneb" have equivalent 
entries ("phonebk") but some don't apparently.

If some Windows alternate sorts do not have equivalent entries, should 
we request for these to be added to the CLDR, or rather use a "variant 
tag", or yet use a "private use" tag in the BCP 47 format?

Patrick Andries



---
L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel antivirus Avast.
https://www.avast.com/antivirus
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://unicode.org/pipermail/cldr-users/attachments/20171118/acbf15cd/attachment-0001.html>


More information about the CLDR-Users mailing list