The Unicode Standard and ISO

Marcel Schneider via Unicode unicode at unicode.org
Sat Jun 9 01:23:33 CDT 2018


On Fri, 8 Jun 2018 09:20:09 -0700, Steven R. Loomis via Unicode wrote:
[…]
> But, it sounds like the CLDR process was successful in this case. Thank you for contributing.
 
You are welcome, but thanks are due to the actual corporate contributors.

[…]
> Actually, I think the particular data item you found is relatively new. The first values entered
> for it in any language were May 18th of this year.  Were there votes for "keycap" earlier?

The "keycap" category is found as soon as in v30 (released 2016-10-05).

> Rather than a tracer finding evidence of neglect, you are at the forefront of progressing the translated data for French. Congratulations!

The neglect is on my part as I neglected to check the data history. 
Please note that I did not make accusations of neglect. Again: The historic Code Charts translators, partly still active, sulk CLDR 
because Unicode is perceived as sulking ISO/IEC 15897, so that minimal staff is actively translating CLDR for the French locale and can 
legitimately feel forsaken. I even made detailed suppositions as of how it could happen that "keycap" remained untranslated.
 
[…] [Unanswered questions (please refer to my other e‐mails in this thread)]

> The registry for ISO/IEC 15897 has neither data for French, nor structure that would translate the term "Characters | Category | Label | keycap". 
> So there would be nothing to merge with there.

Correct. The only data for French is an ISO/IEC 646 charset:
http://std.dkuug.dk/cultreg/registrations/number/156
As far as I can see there are available data to merge for Danish, Faroese, Finnish Greenlandic, Norwegian, and Swedish.

> So, historically, CLDR began not a part of Unicode, but as part of Li18nx under the Free Standards Group. See the bottom of the page 
> http://cldr.unicode.org/index/acknowledgments
> "The founding members of the workgroup were IBM, Sun and OpenOffice.org". 
> What we were trying to do was to provide internationalized content for Linux, and also, to resolve the then-disparity between locale data
> across platforms. Locale data was very divergent between platforms - spelling and word choice changes, etc.  Comparisons were done
> and a Common locale data repository  (with its attendant XML formats) emerged. That's the C in CLDR. Seed data came from IBM’s ICIR
> which dates many decades before 15897 (example 
> http://www.computinghistory.org.uk/det/13342/IBM-National-Language-Support-Reference-Manual-Volume-2/
> - 4th edition published in 1994.) 100 locales we contributed to glibc as well.

Thank you for the account and resources. The Linux Internationalization Initiative appears to have issued a last release on August 23, 2000:
https://www.redhat.com/en/about/press-releases/83
the year before ISO/IEC 15897 was lastly updated:
http://std.dkuug.dk/cultreg/registrations/chreg.htm

> Where there is opportunity for productive sync and merging with is glibc. We have had some discussions, but more needs to be
> done- especially a lot of tooling work. Currently many bug reports are duplicated between glibc and cldr, a sort of manual synchronization.
> Help wanted here. 

Noted. For my part, sadly for C libraries I’m unlikely to be of any help.

Marcel



More information about the Unicode mailing list