Bit arithmetic on Unicode characters?
Shawn.Steele at microsoft.com
Thu Oct 6 18:39:37 CDT 2016
You can't even case Latin that way. Unless maybe you only care about English.
From: Unicode [mailto:unicode-bounces at unicode.org] On Behalf Of Richard Wordingham
Sent: Thursday, October 6, 2016 4:33 PM
To: unicode at unicode.org
Subject: Re: Bit arithmetic on Unicode characters?
On Thu, 6 Oct 2016 12:44:05 -0700
Garth Wallace <gwalla at gmail.com> wrote:
> Other than converting between UTFs, is bit arithmetic commonly
> performed on Unicode characters? I was under the impression that it's
> a rarity if it is done at all.
It's possible to use it for the bulk of case folding, especially if the program only supports a specific repertoire.
For specialist tasks, exploiting arithmetic relationships make sense.
I would expect that most ASCII clones are handled that way. The problem is that manually constructed lookup tables are prone to human error.
More information about the Unicode