Concise term for non-ASCII Unicode characters

Richard Wordingham richard.wordingham at ntlworld.com
Tue Sep 22 02:43:36 CDT 2015


On Sun, 20 Sep 2015 16:52:29 +0000
Peter Constable <petercon at microsoft.com> wrote:

> You already have been using "non-ASCII Unicode", which is about as
> concise and sufficiently accurate as you'll get. There's no term
> specifically defined in any standard or conventionally used for this.

As to standards, UTS#18 'Unicode Regular Expression' Requirement RL1.2
requires the support of the 'property' it calls 'ASCII', which is
defined in Section 1.2.1 as the property of being in the range U+0000 to
U+007F. This implicitly makes 'not ASCII' a derived property held by all
the other codepoints. If you fear that your audience will think that
Latin-1 characters are ASCII, you'll just have to go for the clumsy
'not 7-bit ASCII'  and accept that there isn't an unambiguous way in
English of turning that into an adjective or noun.

If a term were invented, you'd generally have to explain it, and you
would do better just to remind readers what ASCII is.

Richard.


More information about the Unicode mailing list