Corrigendum #9

Richard Wordingham richard.wordingham at ntlworld.com
Mon Jun 2 13:05:11 CDT 2014


On Mon, 2 Jun 2014 10:17:04 -0700
Markus Scherer <markus.icu at gmail.com> wrote:

> CLDR collation data defines special contraction mappings that start
> with a noncharacter, for
> http://www.unicode.org/reports/tr35/tr35-collation.html#CJK_Index_Markers

> In CLDR 23 and before (when we were still using XML collation syntax),
> these were raw noncharacters in the .xml files.

> As I said earlier:
> it should be ok to include noncharacters in CLDR data files for
> processing by CLDR implementations, and it should be possible to edit
> and diff and version-control and web-view those files etc.

They come as a nasty shock when someone thinks XML files are marked-up
text files.  I'm still surprised that the published human-readable form
of CLDR files should contain automatically applied non-Unicode copyright
claims.

Richard.


More information about the Unicode mailing list