Basic Latin digits, not everything else (was: RE: How the C programming language bridges the man-machine gap)
Doug Ewell
doug at ewellic.org
Mon Apr 18 12:41:59 CDT 2022
Hans Åberg wrote:
>> I think it's abundantly clear that the C standard, specifically "the
>> above list of decimal digits," applies to the Basic Latin digits
>> U+0030 through U+0039, and not to superscript digits, subscript
>> digits, negative circled digits, mathematical sans-serif bold digits,
>> or any other digits encoded in Unicode.
>
> The standard only says that from the point of view of C that those
> should be available, not how they should be represented.
The superscript European digits are not the same characters as the regular, full-size European digits, by either Unicode's definition of "same" or that of any other character encoding standard. Thus the C standard is only talking about 0123456789, not ⁰¹²³⁴⁵⁶⁷⁸⁹.
--
Doug Ewell, CC, ALB | Lakewood, CO, US | ewellic.org
More information about the Unicode
mailing list