Inconsistent RBNF Data?
Philippe Verdy
verdy_p at wanadoo.fr
Fri Nov 11 04:03:12 CST 2016
As well this additional subrule:
[...]
%%mille-o:
0: ième;
1: e-=%%et-unieme=;
- 2: e =%%spellout-ordinal=;
+ 2: e-=%%spellout-ordinal=;
11: e-et-onzième;
- 12: e =%%spellout-ordinal=;
+ 12: e-=%%spellout-ordinal=;
[...]
2016-11-11 10:59 GMT+01:00 Philippe Verdy <verdy_p at wanadoo.fr>:
> I forgot this additional subrule to replace spaces by hyphens in ordinals:
>
> [...]
> %%cents-o
> 0: ième;
> 1: -=%%et-unieme=;
> - 2: ' =%%spellout-ordinal=;
> + 2: -=%%spellout-ordinal=;
> 11: -et-onzième;
> - 12: ' =%%spellout-ordinal=;
> - 12: -=%%spellout-ordinal=;
> [...]
>
> 2016-11-11 10:53 GMT+01:00 Philippe Verdy <verdy_p at wanadoo.fr>:
>
>> Basically the needed changes are there for French ordinals:
>>
>> [...]
>> %%spellout-ordinal:
>> [...]
>> - 200: <%spellout-cardinal-masculine< cent>%%cents-o>;
>> + 200: <%spellout-cardinal-masculine<-cent>%%cents-o>;
>> 1000: mill>%%mille-o>;
>> - 2000: <%%spellout-leading< mill>%%mille-o>;
>> + 2000: <%%spellout-leading<-mill>%%mille-o>;
>> - 1000000: <%%spellout-leading< million>%%cents-o>;
>> + 1000000: <%%spellout-leading<-million>%%cents-o>;
>> - 1000000000: <%%spellout-leading< milliard>%%cents-o>;
>> + 1000000000: <%%spellout-leading<-milliard>%%cents-o>;
>> - 1000000000000: <%%spellout-leading< billion>%%cents-o>;
>> + 1000000000000: <%%spellout-leading<-billion>%%cents-o>;
>> - 1000000000000000: <%%spellout-leading< billiard>%%cents-o>;
>> - 1000000000000000: <%%spellout-leading<-billiard>%%cents-o>;
>> 1000000000000000000: =#,##0=;
>> [...]
>>
>> 2016-11-11 8:44 GMT+01:00 Philippe Verdy <verdy_p at wanadoo.fr>:
>>
>>> Well for French ordinals, this still uses old rules using spaces instead
>>> of hyphens.
>>>
>>> Fractions in French are spelled (like in English) as : cardinal
>>> (numerator) + space + ordinal (denominator)
>>> where the ordinal (denominator) here is taking the singular or plural
>>> form, according to the value of the leading cardinal (numerator) and French
>>> plural rules.
>>>
>>> With the old spelling rules of ordinals still using spaces (like in
>>> cardinals), it is difficult to guess the values being meant in
>>> - "(les) deux cent sept millièmes"
>>>
>>> But this ambiguity is solved cleanly in all cases by always using
>>> hyphens instead of spaces within ordinals.
>>>
>>> So a fraction will contain its last space just before the full ordinal
>>> for the denominator :
>>>
>>> - "(le) deux-cent-sept-millième" = (the) 1 / 207,000 = the 207,000th
>>> (singular)
>>> - "(les) deux-cent-sept-millièmes" = (the) 1 / 207,000 = the 207,000th
>>> (plural)
>>> - "(les) deux cent-sept-millièmes" = (the) 2 / 107,000 (plural only)
>>> - "(les) deux cent sept-millièmes" = (the) 200 / 7,000 (plural only)
>>> - "(les) deux-cent-sept millièmes" = (the) 207 / 1,000 (plural only)
>>>
>>> ----
>>> Note: The ordinal (for the numerator) traditionally keeps using spaces
>>> instead of hyphens, except between tens and units, as in:
>>>
>>> - "(tens)-et-un" (10n+1), "(tens)-deux" (10n+2), ... "(tens)-neuf"
>>> (10n+9),
>>> - "soixante-dix" (70) = "septante" (in Belgian French and Swiss French),
>>> - "soixante-et-onze" (71) = "septante-et-un" (in Belgian French and
>>> Swiss French),
>>> - "soixante-douze" (72) = "septante-deux" (in Belgian French and Swiss
>>> French),
>>> - "soixante-treize" (73) = "septante-trois" (in Belgian French and Swiss
>>> French),
>>> - "soixante-quatorze" (74) = "septante-quatre" (in Belgian French and
>>> Swiss French),
>>> - "soixante-quinze" (75) = "septante-cinq" (in Belgian French and Swiss
>>> French),
>>> - "soixante-seize" (76) = "septante-six" (in Belgian French and Swiss
>>> French),
>>> - "soixante dix-sept" (77) = "septante-sept" (in Belgian French and
>>> Swiss French),
>>> - "soixante dix-huit" (78) = "septante-huit" (in Belgian French and
>>> Swiss French),
>>> - "soixante dix-neuf" (79) = "septante-neuf" (in Belgian French and
>>> Swiss French),
>>> - "quatre-vingt" (80) = "octante" (in Belgian French and Swiss French),
>>> - "quatre-vingt onze" (81) = "octante-et-un" (in Belgian French and
>>> Swiss French),
>>> - "quatre-vingt douze" (82) = "octante-deux" (in Belgian French and
>>> Swiss French),
>>> - "quatre-vingt treize" (83) = "octante-trois" (in Belgian French and
>>> Swiss French),
>>> - "quatre-vingt quatorze" (84) = "octante-quatre" (in Belgian French and
>>> Swiss French),
>>> - "quatre-vingt quinze" (85) = "octante-cinq" (in Belgian French and
>>> Swiss French),
>>> - "quatre-vingt seize" (86) = "octante-six" (in Belgian French and Swiss
>>> French),
>>> - "quatre-vingt dix-sept" (87) = "octante-sept" (in Belgian French and
>>> Swiss French),
>>> - "quatre-vingt dix-huit" (88) = "octante-huit" (in Belgian French and
>>> Swiss French),
>>> - "quatre-vingt dix-neuf" (89) = "octante-neuf" (in Belgian French and
>>> Swiss French),
>>>
>>> But even in this list of cardinals spaces are also possible everywhere
>>> instead of hyphens between tens and units, the hyphen is strongly
>>> recommended only in "quatre-vingt". It is only the most common usage to use
>>> hyphens between tens an unit in cardinals.
>>>
>>> In legal documents, cardinals are written using hyphens everywhere
>>> instead of spaces.
>>>
>>> In all cases (traditional, most common, or legal), the space remains
>>> mandatory (an hyphen is strictly forbidden there) only between the
>>> numerator cardinal and the denominator ordinal of a fraction.
>>>
>>>
>>> 2016-11-11 2:10 GMT+01:00 Kent Karlsson <kent.karlsson14 at telia.com>:
>>>
>>>> Looking at the RBNF source can be hard to follow (especially for the
>>>> more complicated cases, even Italian is
>>>> quite complex).
>>>>
>>>> Though I used my own program for testing nearly a decade ago when I
>>>> worked on this, there is now a public
>>>> web page (not made by me, but by the person that took over maintaining
>>>> the RBNF rules) for testing RBNFs:
>>>>
>>>> http://st.unicode.org/cldr-apps/numbers.jsp
>>>>
>>>> This is easier to follow than the rules themselves and can be used to
>>>> find errors and test fixes to the RBNF rules.
>>>> Note that the rules are in ICU format, not in the XML format found in
>>>> CLDR. You can edit the rules, and the
>>>> numbers to be used for testing.
>>>>
>>>> /Kent K
>>>>
>>>> Den 2016-11-10 11:30, skrev "Rafael Xavier" <rxaviers at gmail.com>:
>>>>
>>>> That's not quite true for Portuguese - the forms are a bit different. I
>>>> had a quick look at the RBNF though and it looks correct there.
>>>>
>>>> http://unicode.org/cldr/trac/browser/tags/release-30-d05/com
>>>> mon/rbnf/pt.xml#L78
>>>>
>>>>
>>>> +1 confirming existing RBNF for pt is correct (as a native Portuguese
>>>> speaker and [1]).
>>>>
>>>> 1: http://veja.abril.com.br/blog/sobre-palavras/consultorio/duz
>>>> entas-mil-pessoas-ou-duzentos-mil-pessoas/
>>>>
>>>> On Thu, Nov 10, 2016 at 6:42 AM, Fòram na Gàidhlig <
>>>> fios at foramnagaidhlig.net> wrote:
>>>>
>>>> Sgrìobh Kent Karlsson na leanas 09/11/2016 aig 21:23:
>>>> > Right, I misread.
>>>> >
>>>> > I note that both Spanish and Portuguese has "ciento" as corrections to
>>>> > the "-feminine" cases.
>>>>
>>>> That's not quite true for Portuguese - the forms are a bit different. I
>>>> had a quick look at the RBNF though and it looks correct there.
>>>>
>>>> http://unicode.org/cldr/trac/browser/tags/release-30-d05/com
>>>> mon/rbnf/pt.xml#L78
>>>> _______________________________________________
>>>> CLDR-Users mailing list
>>>> CLDR-Users at unicode.org
>>>> http://unicode.org/mailman/listinfo/cldr-users
>>>>
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> CLDR-Users mailing list
>>>> CLDR-Users at unicode.org
>>>> http://unicode.org/mailman/listinfo/cldr-users
>>>>
>>>>
>>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://unicode.org/pipermail/cldr-users/attachments/20161111/3172b402/attachment.html>
More information about the CLDR-Users
mailing list