Bit arithmetic on Unicode characters?

Shawn Steele Shawn.Steele at microsoft.com
Thu Oct 6 18:39:37 CDT 2016


You can't even case Latin that way.  Unless maybe you only care about English.

-----Original Message-----
From: Unicode [mailto:unicode-bounces at unicode.org] On Behalf Of Richard Wordingham
Sent: Thursday, October 6, 2016 4:33 PM
To: unicode at unicode.org
Subject: Re: Bit arithmetic on Unicode characters?

On Thu, 6 Oct 2016 12:44:05 -0700
Garth Wallace <gwalla at gmail.com> wrote:

> Other than converting between UTFs, is bit arithmetic commonly 
> performed on Unicode characters? I was under the impression that it's 
> a rarity if it is done at all.

It's possible to use it for the bulk of case folding, especially if the program only supports a specific repertoire.

For specialist tasks, exploiting arithmetic relationships make sense.
I would expect that most ASCII clones are handled that way.  The problem is that manually constructed lookup tables are prone to human error.

Richard.



More information about the Unicode mailing list