Basic Latin digits, not everything else (was: RE: How the C programming language bridges the man-machine gap)

Doug Ewell doug at
Mon Apr 18 12:41:59 CDT 2022

Hans Åberg wrote:

>> I think it's abundantly clear that the C standard, specifically "the
>> above list of decimal digits," applies to the Basic Latin digits
>> U+0030 through U+0039, and not to superscript digits, subscript
>> digits, negative circled digits, mathematical sans-serif bold digits,
>> or any other digits encoded in Unicode.
> The standard only says that from the point of view of C that those
> should be available, not how they should be represented.

The superscript European digits are not the same characters as the regular, full-size European digits, by either Unicode's definition of "same" or that of any other character encoding standard. Thus the C standard is only talking about 0123456789, not ⁰¹²³⁴⁵⁶⁷⁸⁹.

Doug Ewell, CC, ALB | Lakewood, CO, US |

More information about the Unicode mailing list