German sharp S uppercase mapping
Asmus Freytag
asmusf at ix.netcom.com
Tue Nov 26 15:18:11 CST 2024
On 11/26/2024 8:59 AM, Peter Constable via Unicode wrote:
> The case pair stability requirement applies to default data. Conforming applications can still tailor case mapping behaviour use language-specific overrides. However, the default case pairs as defined in the SpecialCasing.txt and UnicodeData.txt files must remain stable so as not to break existing implementations of file systems or other identifier systems that depend on the default case pairs.
>
>
> Peter
The case pair stability is indeed important, if not essential for
identifiers. However, in the past, we may have been a bit cavalier on
how an identifier-safe casing (default) intersects with text processing
in general.
I think there's a case to be maded for someone writing a short proposal
to clarify whatever language we are using and to make clear, among other
things, that **text processing** should be using the appropriate
locale-based casing (from CLDR).
We should probably go further and explicitly reposition the "default"
casing as "identifier-safe" because that is what motivates the stability
requirements.
There's one more wrinkle. Because the sharp S is not natively used
outside German, it would be reasonable to have a multicultural case
mapping that treats this, and all cases where local usages do not
conflict, according to the evolving rules of of the local culture(s)
that use the character, so that there's an optional level between
identtifier stable and fully local case mappings.
To borrow a bit of terminology from some other implementation we need
these levels:
* InvariantCulture (stable default/identifier-safe)
* CommonCulture (current culture, common defaul texcept for
conflicting use)
* LocalCulture (current culture... one per locale ID)
* (Throw in regional if you have to, somewhere).
The ideal case would be one where out of the box text processing
operations can default to something that performs according to the
current (evolving) needs for as many cultures as possible (except where
they have contradictory treatment of the same character).
And where the fully invariant case is positioned to make clear where
it's applicable (identifier) and where its use is therefore mandatory.
A./
>
> -----Original Message-----
> From: Unicode<unicode-bounces at corp.unicode.org> On Behalf Of Daniel Buncic via Unicode
> Sent: November 26, 2024 2:44 AM
> To:unicode at corp.unicode.org
> Subject: Re: German sharp S uppercase mapping
>
> <snip>
>
> As to the case pair stability guarantee, ... if case folding changes in that language, then Unicode has to adjust. It would be poor service to the public to stick to a case mapping that is no longer valid just because Unicode came into existence at a time when it was still valid. ...
>
> <snip>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://corp.unicode.org/pipermail/unicode/attachments/20241126/5554bdbc/attachment.htm>
More information about the Unicode
mailing list