Why does the spelling (capitalization) of decomposition types differ in DerivedDecompositionType.txt from UnicodeData.txt?

Asmus Freytag asmusf at ix.netcom.com
Tue Feb 18 21:26:52 CST 2025


On 2/18/2025 10:44 AM, Asmus Freytag via Unicode wrote:
> The spellings are equivalent under the naming rules. That's all that 
> formally matters. Fixing this now, would break any literal-minded 
> parsers for whichever file is changed, while not making a formal 
> difference.
>
> There are enough other idiosyncrasies in the way these files are 
> organized, that this one is far from the worst.
>
> The only rule that matters is that any of the values in 
> PropertyValueAliases.txt, when matched without regard to case, 
> hyphens, or underscore, matches all the other ones for the same 
> property value.
Sorry, badly phased: any string that matches any of the ...
>
> For character names, spaces also don't count (but there are 2-3 odd 
> exceptional names that need to be handled specially).
>
> A./
>
> On 2/18/2025 8:04 AM, Phil Smith III via Unicode wrote:
>> This sounds interesting, but with no links or other references is a 
>> bit opaque. Can you add more information?
>>
>> -----Original Message-----
>> From: Unicode <unicode-bounces at corp.unicode.org> On Behalf Of 
>> prospero via Unicode
>> Sent: Monday, February 17, 2025 3:11 PM
>> To: unicode at corp.unicode.org
>> Subject: Why does the spelling (capitalization) of decomposition 
>> types differ in DerivedDecompositionType.txt from UnicodeData.txt?
>>
>> For example, "Nobreak" in DerivedDecompositionType.txt vs "noBreak" 
>> in UnicodeData.txt. If the former is derived from the latter, 
>> shouldn't the spelling be identical?
>>
>>
>



More information about the Unicode mailing list