question about identifying CLDR coverage % for Amharic
Tom Bishop, Wenlin Institute
tangmu at wenlin.com
Thu Feb 23 15:58:01 CST 2017
In this part of am.xml the content all looks Amharic:
<type key="collation" type="phonebook">የስልክ ደብተር ድርድር ቅደም ተከተል</type>
<type key="collation" type="phonetic">የፎነቲክ ድርደራ ቅደም ተከተል</type>
<type key="collation" type="pinyin">ፒንይን የድርድር ቅደም ተከተል</type>
In this part English and Amharic content are mixed:
<characterLabel type="modifier">Modifier</characterLabel>
<characterLabel type="musical_symbols">የሙዚቃ ምልክቶች</characterLabel>
<characterLabel type="nature">ተፈጥሮ</characterLabel>
<characterLabel type="nonspacing">Nonspacing</characterLabel>
<characterLabel type="numbers">ቁጥሮች</characterLabel>
<characterLabel type="objects">Objects</characterLabel>
<characterLabel type="other">ሌሎች</characterLabel>
Is there an established system to derive a meaningful "percentage of CLDR coverage for Amharic" from the data? Just from these (not really random) examples one might estimate 70%.
Tom
> On Feb 23, 2017, at 2:01 PM, Chris Leonard <cjl at sugarlabs.org> wrote:
>
> Download the latest version of CLDR
>
> http://unicode.org/Public/cldr/30.0.3/ <http://unicode.org/Public/cldr/30.0.3/>
>
> specifically the core.zip file
>
> unzip the core.zip file, open the common folder, open the main folder, look for am.xml and am_ET.xml, find the attached files.
>
> Is that what you are looking for?
>
> cjl
>
>
>
>
>
> On Thu, Feb 23, 2017 at 12:18 PM, Isabelle Zaugg <iz6445a at student.american.edu <mailto:iz6445a at student.american.edu>> wrote:
> Dear All,
>
> I am working on my dissertation research and would like to identify the percentage of CLDR coverage for Amharic and the other languages utilizing the Ethiopic script. I would like to get a percentage coverage for today, as well as look at the increase over time. So far here is what I have been told:
>
> The files in common/main are organized by locale, which uses a code, eg fr.xml for French. So you can look for the ones you want. To get the codes, you can look at http://www.unicode.org/cldr/charts/latest/supplemental/languages_and_scripts.html <http://www.unicode.org/cldr/charts/latest/supplemental/languages_and_scripts.html>. For example, Ethiopic is 'am'. For more information see the spec: http://unicode.org/reports/tr35/ <http://unicode.org/reports/tr35/>
> To get changes over versions, you would download successive versions. Just the am.xml should be good enough; the regional variants typically inherit.
> Following these guidelines, I was still unable to identify the percentage of CLDR coverage for Amharic. Is there anyone who could help me with this issue?
>
> Thank you,
>
> Isabelle
>
>
> Isabelle Zaugg
> Fulbright-Hays Doctoral Dissertation Research Abroad Fellow in ኢትዮጵያ
> PhD Candidate in Communication
> American University
> Washington, D.C.
>
>
> _______________________________________________
> CLDR-Users mailing list
> CLDR-Users at unicode.org <mailto:CLDR-Users at unicode.org>
> http://unicode.org/mailman/listinfo/cldr-users <http://unicode.org/mailman/listinfo/cldr-users>
>
>
> <am.xml><am_ET.xml>_______________________________________________
> CLDR-Users mailing list
> CLDR-Users at unicode.org
> http://unicode.org/mailman/listinfo/cldr-users
Wenlin Institute, Inc. SPC (a Social Purpose Corporation)
文林研究所社会目的公司
Software for Learning Chinese
E-mail: wenlin at wenlin.com Web: http://www.wenlin.com
Telephone: 1-877-4-WENLIN (1-877-493-6546)
☯
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://unicode.org/pipermail/cldr-users/attachments/20170223/a27d8f4c/attachment.html>
More information about the CLDR-Users
mailing list