Basic Latin digits, not everything else (was: RE: How the C programming language bridges the man-machine gap)

Hans Åberg haberg-1 at
Mon Apr 18 02:46:09 CDT 2022

> On 18 Apr 2022, at 07:51, Doug Ewell via Unicode <unicode at> wrote:
> Hans Åberg wrote:
>>>> One can't use say the Unicode superscript numbers and their code
>>>> points directly for C.
>>> Was that part of the use case?
>> 5.2.1 Character sets
>>>> In both the source and execution basic character sets, the value of
>> each character after 0 in the above list of decimal digits shall be
>> one greater than the value of the previous.
> I think it's abundantly clear that the C standard, specifically "the above list of decimal digits," applies to the Basic Latin digits U+0030 through U+0039, and not to superscript digits, subscript digits, negative circled digits, mathematical sans-serif bold digits, or any other digits encoded in Unicode.

The standard only says that from the point of view of C that those should be available, not how they should be represented.

More information about the Unicode mailing list