Inconsistent RBNF Data?

Philippe Verdy verdy_p at wanadoo.fr
Fri Nov 11 03:53:58 CST 2016


Basically the needed changes are there for French ordinals:

[...]
  %%spellout-ordinal:
[...]
- 200: <%spellout-cardinal-masculine< cent>%%cents-o>;
+ 200: <%spellout-cardinal-masculine<-cent>%%cents-o>;
  1000: mill>%%mille-o>;
- 2000: <%%spellout-leading< mill>%%mille-o>;
+ 2000: <%%spellout-leading<-mill>%%mille-o>;
- 1000000: <%%spellout-leading< million>%%cents-o>;
+ 1000000: <%%spellout-leading<-million>%%cents-o>;
- 1000000000: <%%spellout-leading< milliard>%%cents-o>;
+ 1000000000: <%%spellout-leading<-milliard>%%cents-o>;
- 1000000000000: <%%spellout-leading< billion>%%cents-o>;
+ 1000000000000: <%%spellout-leading<-billion>%%cents-o>;
- 1000000000000000: <%%spellout-leading< billiard>%%cents-o>;
- 1000000000000000: <%%spellout-leading<-billiard>%%cents-o>;
  1000000000000000000: =#,##0=;
[...]

2016-11-11 8:44 GMT+01:00 Philippe Verdy <verdy_p at wanadoo.fr>:

> Well for French ordinals, this still uses old rules using spaces instead
> of hyphens.
>
> Fractions in French are spelled (like in English) as : cardinal
> (numerator) + space + ordinal (denominator)
> where the ordinal (denominator) here is taking the singular or plural
> form, according to the value of the leading cardinal (numerator) and French
> plural rules.
>
> With the old spelling rules of ordinals still using spaces (like in
> cardinals), it is difficult to guess the values being meant in
> - "(les) deux cent sept millièmes"
>
> But this ambiguity is solved cleanly in all cases by always using hyphens
> instead of spaces within ordinals.
>
> So a fraction will contain its last space just before the full ordinal for
> the denominator :
>
> - "(le) deux-cent-sept-millième" = (the) 1 / 207,000 = the 207,000th
> (singular)
> - "(les) deux-cent-sept-millièmes" = (the) 1 / 207,000 = the 207,000th
> (plural)
> - "(les) deux cent-sept-millièmes" = (the) 2 / 107,000 (plural only)
> - "(les) deux cent sept-millièmes" = (the) 200 / 7,000 (plural only)
> - "(les) deux-cent-sept millièmes" = (the) 207 / 1,000 (plural only)
>
> ----
> Note:  The ordinal (for the numerator) traditionally keeps using spaces
> instead of hyphens, except between tens and units, as in:
>
> - "(tens)-et-un" (10n+1), "(tens)-deux" (10n+2), ... "(tens)-neuf" (10n+9),
> - "soixante-dix" (70) = "septante" (in Belgian French and Swiss French),
> - "soixante-et-onze" (71) = "septante-et-un" (in Belgian French and Swiss
> French),
> - "soixante-douze" (72) = "septante-deux" (in Belgian French and Swiss
> French),
> - "soixante-treize" (73) = "septante-trois" (in Belgian French and Swiss
> French),
> - "soixante-quatorze" (74) = "septante-quatre" (in Belgian French and
> Swiss French),
> - "soixante-quinze" (75) = "septante-cinq" (in Belgian French and Swiss
> French),
> - "soixante-seize" (76) = "septante-six" (in Belgian French and Swiss
> French),
> - "soixante dix-sept" (77) = "septante-sept" (in Belgian French and Swiss
> French),
> - "soixante dix-huit" (78) = "septante-huit" (in Belgian French and Swiss
> French),
> - "soixante dix-neuf" (79) = "septante-neuf" (in Belgian French and Swiss
> French),
> - "quatre-vingt" (80) = "octante" (in Belgian French and Swiss French),
> - "quatre-vingt onze" (81) = "octante-et-un" (in Belgian French and Swiss
> French),
> - "quatre-vingt douze" (82) = "octante-deux" (in Belgian French and Swiss
> French),
> - "quatre-vingt treize" (83) = "octante-trois" (in Belgian French and
> Swiss French),
> - "quatre-vingt quatorze" (84) = "octante-quatre" (in Belgian French and
> Swiss French),
> - "quatre-vingt quinze" (85) = "octante-cinq" (in Belgian French and Swiss
> French),
> - "quatre-vingt seize" (86) = "octante-six" (in Belgian French and Swiss
> French),
> - "quatre-vingt dix-sept" (87) = "octante-sept" (in Belgian French and
> Swiss French),
> - "quatre-vingt dix-huit" (88) = "octante-huit" (in Belgian French and
> Swiss French),
> - "quatre-vingt dix-neuf" (89) = "octante-neuf" (in Belgian French and
> Swiss French),
>
> But even in this list of cardinals spaces are also possible everywhere
> instead of hyphens between tens and units, the hyphen is strongly
> recommended only in "quatre-vingt". It is only the most common usage to use
> hyphens between tens an unit in cardinals.
>
> In legal documents, cardinals are written using hyphens everywhere instead
> of spaces.
>
> In all cases (traditional, most common, or legal), the space remains
> mandatory (an hyphen is strictly forbidden there) only between the
> numerator cardinal and the denominator ordinal of a fraction.
>
>
> 2016-11-11 2:10 GMT+01:00 Kent Karlsson <kent.karlsson14 at telia.com>:
>
>> Looking at the RBNF source can be hard to follow (especially for the more
>> complicated cases, even Italian is
>> quite complex).
>>
>> Though I used my own program for testing nearly a decade ago when I
>> worked on this, there is now a public
>> web page (not made by me, but by the person that took over maintaining
>> the RBNF rules) for testing RBNFs:
>>
>> http://st.unicode.org/cldr-apps/numbers.jsp
>>
>> This is easier to follow than the rules themselves and can be used to
>> find errors and test fixes to the RBNF rules.
>> Note that the rules are in ICU format, not in the XML format found in
>> CLDR. You can edit the rules, and the
>> numbers to be used for testing.
>>
>> /Kent K
>>
>> Den 2016-11-10 11:30, skrev "Rafael Xavier" <rxaviers at gmail.com>:
>>
>> That's not quite true for Portuguese - the forms are a bit different. I
>> had a quick look at the RBNF though and it looks correct there.
>>
>> http://unicode.org/cldr/trac/browser/tags/release-30-d05/com
>> mon/rbnf/pt.xml#L78
>>
>>
>> +1 confirming existing RBNF for pt is correct (as a native Portuguese
>> speaker and [1]).
>>
>> 1: http://veja.abril.com.br/blog/sobre-palavras/consultorio/duz
>> entas-mil-pessoas-ou-duzentos-mil-pessoas/
>>
>> On Thu, Nov 10, 2016 at 6:42 AM, Fòram na Gàidhlig <
>> fios at foramnagaidhlig.net> wrote:
>>
>> Sgrìobh Kent Karlsson na leanas 09/11/2016 aig 21:23:
>> > Right, I misread.
>> >
>> > I note that both Spanish and Portuguese has "ciento" as corrections to
>> > the "-feminine" cases.
>>
>> That's not quite true for Portuguese - the forms are a bit different. I
>> had a quick look at the RBNF though and it looks correct there.
>>
>> http://unicode.org/cldr/trac/browser/tags/release-30-d05/com
>> mon/rbnf/pt.xml#L78
>> _______________________________________________
>> CLDR-Users mailing list
>> CLDR-Users at unicode.org
>> http://unicode.org/mailman/listinfo/cldr-users
>>
>>
>>
>>
>> _______________________________________________
>> CLDR-Users mailing list
>> CLDR-Users at unicode.org
>> http://unicode.org/mailman/listinfo/cldr-users
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://unicode.org/pipermail/cldr-users/attachments/20161111/08c350aa/attachment-0001.html>


More information about the CLDR-Users mailing list