Bit arithmetic on Unicode characters?

Shawn Steele Shawn.Steele at
Thu Oct 6 18:39:37 CDT 2016

You can't even case Latin that way.  Unless maybe you only care about English.

-----Original Message-----
From: Unicode [mailto:unicode-bounces at] On Behalf Of Richard Wordingham
Sent: Thursday, October 6, 2016 4:33 PM
To: unicode at
Subject: Re: Bit arithmetic on Unicode characters?

On Thu, 6 Oct 2016 12:44:05 -0700
Garth Wallace <gwalla at> wrote:

> Other than converting between UTFs, is bit arithmetic commonly 
> performed on Unicode characters? I was under the impression that it's 
> a rarity if it is done at all.

It's possible to use it for the bulk of case folding, especially if the program only supports a specific repertoire.

For specialist tasks, exploiting arithmetic relationships make sense.
I would expect that most ASCII clones are handled that way.  The problem is that manually constructed lookup tables are prone to human error.


More information about the Unicode mailing list