Mapping Unicode script name to CLDR script code

Kip Cole kipcole9 at gmail.com
Sat Mar 13 19:41:22 CST 2021


I note that https://unicode-org.github.io/cldr-staging/charts/39/supplemental/languages_and_scripts.html <https://unicode-org.github.io/cldr-staging/charts/39/supplemental/languages_and_scripts.html> does map from Unicode language name (at least informally) to CLDR language code but that mapping isn’t, as far as I can see, in supplementalData.xml.


> On 14 Mar 2021, at 9:35 am, Kip Cole <kipcole9 at gmail.com> wrote:
> 
> Using the script properties (from scripts.txt in the Unicode repo for example), the script of some text can be detected. 
> 
> However I am not able to find a mapping from Unicode script names to CLDR script codes.  Ie a way to map "Hirigana -> Jpan" or "Javanese -> Java".
> 
> I’ve checked supplementalData.xml and scriptMetadata.txt to no avail.
> 
> Is there a canonical mapping somewhere?
> 
> Many thanks, —Kip
> 
> 
> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://corp.unicode.org/pipermail/cldr-users/attachments/20210314/ebadf1a6/attachment-0001.htm>


More information about the CLDR-Users mailing list