<div style="font-family: 'verdana'; font-size: 12px; color: #000;"><span style="background-color: rgb(255, 255, 255);">Dear Daniel, dear all,</span><br><br></div>
<div style="font-family: 'verdana'; font-size: 12px; color: #000;"><span style="background-color: rgb(255, 255, 255);">the machine-readable area of ID cards and passports only supports a limited character set, so it stores a transcription of the name. For example, umlauts like ä, ö and ü are mapped to AE, OE and UE. ß is mapped to SS, which sometimes causes problems at borders, because officers may confuse the ß with other letters like the Cyrillic в. In Austrian passports this is solved by the remark „‚ß‘ entspricht / is equal to / correspond à ‚ss‘“. The capital ẞ is not supported in ISO/IEC 7810.</span></div>
<div style="font-family: 'verdana'; font-size: 12px; color: #000;"><span style="background-color: rgb(255, 255, 255);"> </span></div>
<div style="font-family: 'verdana'; font-size: 12px; color: #000;"><span style="background-color: rgb(255, 255, 255);">I agree, that the default mapping should be kept as SS for compatibility reasons, but a language-specific mapping could be implemented in ICU, similar to the special case handling of dotless and dotted I in Turkish. Unicode should annotate the ß character that its default uppercase mapping is SS, but the preferred mapping is <span style="background-color: rgb(255, 255, 255);">ẞ.</span></span></div>
<div style="font-family: 'verdana'; font-size: 12px; color: #000;"><span style="background-color: rgb(255, 255, 255);"><span style="background-color: rgb(255, 255, 255);"> </span></span></div>
<div style="font-family: 'verdana'; font-size: 12px; color: #000;"><span style="background-color: rgb(255, 255, 255);"><span style="background-color: rgb(255, 255, 255);">Best regards,</span></span></div>
<div style="font-family: 'verdana'; font-size: 12px; color: #000;"><span style="background-color: rgb(255, 255, 255);"><span style="background-color: rgb(255, 255, 255);"> </span></span></div>
<div style="font-family: 'verdana'; font-size: 12px; color: #000;"><span style="background-color: rgb(255, 255, 255);"><span style="background-color: rgb(255, 255, 255);">Marius Spix</span></span></div>
<div style="font-family: 'verdana'; font-size: 12px; color: #000;"> </div>
<div style="font-family: 'verdana'; font-size: 12px; color: #000;"> </div>
<div id="sub-body-container" style="margin: 10px 5px 5px 10px; padding: 10px 0px 10px 10px; border-left: 2px solid rgb(195, 217, 229);">
<div style="margin: 0px 0px 10px;">
<div><strong>Gesendet: </strong>Dienstag, 26. November 2024 um 10:43</div>
<div><strong>Von: </strong>"Daniel Buncic via Unicode" <unicode@corp.unicode.org></div>
<div><strong>An: </strong>unicode@corp.unicode.org</div>
<div><strong>Betreff: </strong>Re: German sharp S uppercase mapping</div>
</div>
Dear Jules, dear all,<br><br>Thank you for bringing this up. As a German linguist and main editor of <br>two journals, I would like to give my opinion on this.<br><br>First of all, for the context. German ⟨ß⟩ never occurs at the beginning <br>of words. Consequently, the case mapping is necessary exclusively for <br>all-caps and small caps contexts, not for sentence case (or title case, <br>which is not used in German anyway). Originally, ⟨ß⟩ emerged in <br>‘German’ variants of the Latin alphabet, blackletter and Kurrent. <br>Although the exact origin is still unclear, it was always interpreted as <br>a ligature of either ⟨ſz⟩ (or rather ⟨ſʒ⟩) or ⟨ſs⟩. On the one hand, <br>the ⟨ß⟩ was also used in other languages like Polish or Hungarian (for <br>modern ⟨sz⟩) and even, in roman type (antiqua), for Latin or French ⟨ss⟩ <br>(cf. <a href="https://en.wikipedia.org/wiki/" target="_blank" rel="noopener noreferrer">https://en.wikipedia.org/wiki/</a>ß#Use_in_Roman_type). On the other <br>hand, traditionally, roman type did not have the letter ⟨ß⟩, which is <br>why it has a long tradition of being replaced with ⟨ss⟩.<br><br>This is why in lots of surnames, the spellings ⟨ß⟩ and ⟨ss⟩ exist side <br>by side. However, names like Geßler and Gessler, Meißner and Meissner, <br>Weiß and Weiss, Voß and Voss, although historically having <br>differentiated through chance spellings at times when the spelling was <br>not fixed yet, nowadays have a fixed spelling and are therefore regarded <br>as being different names, with their bearers insisting on the correct <br>spelling of their name as ⟨Heß⟩, not ⟨Hess⟩, etc.<br><br>Therefore it is important to keep ⟨ß⟩ and ⟨ss⟩ apart even in all-caps or <br>small caps (e.g. in forms that have to be filled out in capital letters, <br>in headings that only have capitals, etc.). Until 1998, the <br>then-normative dictionary, Duden, recommended generally using ⟨SS⟩ for <br>⟨ß⟩ but ⟨SZ⟩ for ⟨ß⟩ wherever the difference was important, giving the <br>example of ⟨in Massen⟩ /ʔɪnˈmasən/ ‘en masse’ vs. ⟨in Maßen⟩ <br>/ʔɪnˈmaːsən/ ‘in moderation’, where the latter would have to be <br>capitalized as ⟨IN MASZEN⟩. With proper names, however, this method <br>does not work, because many names also have a variant with ⟨sz⟩ (which <br>in some cases occurred through a different interpretation of ⟨ſʒ⟩ in the <br>old documents, in some cases through Hungarian mediation). See <br><a href="https://de.wikipedia.org/wiki/Geszler" target="_blank" rel="noopener noreferrer">https://de.wikipedia.org/wiki/Geszler</a> vs. <br><a href="https://de.wikipedia.org/wiki/Ge" target="_blank" rel="noopener noreferrer">https://de.wikipedia.org/wiki/Ge</a>ßler, <br><a href="https://de.wikipedia.org/wiki/Meiszner" target="_blank" rel="noopener noreferrer">https://de.wikipedia.org/wiki/Meiszner</a> vs. <br><a href="https://de.wikipedia.org/wiki/Mei" target="_blank" rel="noopener noreferrer">https://de.wikipedia.org/wiki/Mei</a>ßner_(Familienname), <br><a href="https://de.wikipedia.org/wiki/Weisz" target="_blank" rel="noopener noreferrer">https://de.wikipedia.org/wiki/Weisz</a> vs. <br><a href="https://de.wikipedia.org/wiki/Wei" target="_blank" rel="noopener noreferrer">https://de.wikipedia.org/wiki/Wei</a>ß_(Familienname), <br><a href="https://de.wikipedia.org/wiki/Vosz" target="_blank" rel="noopener noreferrer">https://de.wikipedia.org/wiki/Vosz</a> vs. <br><a href="https://de.wikipedia.org/wiki/Vo" target="_blank" rel="noopener noreferrer">https://de.wikipedia.org/wiki/Vo</a>ß, <a href="https://de.wikipedia.org/wiki/Hesz" target="_blank" rel="noopener noreferrer">https://de.wikipedia.org/wiki/Hesz</a> <br>vs. <a href="https://de.wikipedia.org/wiki/He" target="_blank" rel="noopener noreferrer">https://de.wikipedia.org/wiki/He</a>ß. This ⟨SZ⟩ rule (which was never <br>implemented in Unicode) was deservedly abolished in the spelling reform <br>of 1998.<br><br>For a long time, the practice therefore was that people printed their <br>name in forms as WEIß, VOß, GEßLER, etc., which was often misread as <br>WEIB, VOB, GEBLER, etc. The CAPS LOCK key on German computer keyboards <br>had no effect on ⟨ß⟩ and word processors also left ß unchanged when <br>changing a text to all-caps or small caps; in order to achieve the <br>desired result ⟨FUSSBALL⟩ ‘soccer’ or ⟨WEISSHAUSSTRASSE⟩ (literally <br>‘Whitehouse Street’, a streetcar stop in Cologne, which is definitely <br>lacking in readability in this all-caps version with 3×⟨SS⟩), you had to <br>change ⟨Fußball⟩ to ⟨Fussball⟩ or ⟨Weißhausstraße⟩ to ⟨Weisshausstrasse⟩ <br>manually, which many people did not do, so ⟨FUßBALL⟩ or ⟨WEIßHAUSSTRAßE⟩ <br>were non-normative spellings seen extremely often, even in texts printed <br>by renowned publishers or on official signs.<br><br>The solution for all this mess was the introduction of capital ⟨ẞ⟩, <br>which enables us to spell GEẞLER, MEIẞNER, WEIẞ, VOẞ, HEẞ, FUẞBALL, <br>WEIẞHAUSSTRAẞE, etc. in all-caps (or small caps) with an unambiguous <br>capital version of ⟨ß⟩. On contemporary computer keyboards, the CAPS <br>LOCK key produces capital ⟨ẞ⟩ when the ⟨ß⟩ key is pushed. (Note that <br>this is completely sufficient and that no separate capital ⟨ẞ⟩ key is <br>needed because the capital letter never has to be typed outside all-caps <br>environments, so SHIFT + ⟨ß⟩ can continue to produce the question mark.)<br><br>The council regulating official German orthography, which consists of <br>experts from Germany, Austria, Switzerland, Liechtenstein, northern <br>Italy and eastern Belgium, admitted the spelling with capital ⟨ẞ⟩ in <br>2017 and made it the standard form, with ⟨SS⟩ to be used only where <br>capital ⟨ẞ⟩ is not available, on 15 December 2023 (see <br><a href="https://grammis.ids-mannheim.de/rechtschreibung/6180" target="_blank" rel="noopener noreferrer">https://grammis.ids-mannheim.de/rechtschreibung/6180</a>).<br><br>The practice is also changing rapidly (and this is why the orthography <br>council made this decision, because they have a very descriptive rather <br>than prescriptive policy). Although many fonts still do not contain <br>capital ⟨ẞ⟩ and some word processors have not updated their <br>capitalization algorithms yet (maybe because they rely on Unicode?), you <br>see capital ⟨ẞ⟩ used more and more frequently, and it is eagerly taken <br>up by people called Weiß, Voß, Geßler, etc. The ambiguity of ⟨WEISS⟩, <br>⟨VOSS⟩, ⟨GESSLER⟩, etc., which leads to a real loss of information, <br>should be reason enough to change the capitalization rule of Unicode <br>from ß → SS to ß → ẞ, i.e. to change the relevant line in <br><a href="https://www.unicode.org/Public/16.0.0/ucd/CaseFolding.txt" target="_blank" rel="noopener noreferrer">https://www.unicode.org/Public/16.0.0/ucd/CaseFolding.txt</a> from<br>00DF; F; 0073 0073; # LATIN SMALL LETTER SHARP S<br>(actually, why isn’t it 0053 0053?)<br>to<br>00DF; C; 1E9E; # LATIN SMALL LETTER SHARP S,<br>delete the line<br>1E9E; F; 0073 0073; # LATIN CAPITAL LETTER SHARP S,<br>change “S” to “C” in the line<br>1E9E; S; 00DF; # LATIN CAPITAL LETTER SHARP S,<br>and then delete the treatment of ⟨ß⟩ in <br><a href="https://www.unicode.org/Public/16.0.0/ucd/SpecialCasing.txt" target="_blank" rel="noopener noreferrer">https://www.unicode.org/Public/16.0.0/ucd/SpecialCasing.txt</a> because ⟨ß⟩ <br>does not need special treatment anymore.<br><br>As to the case pair stability guarantee, I would like to stress that ⟨ß⟩ <br>is used in no other modern language than German, and if case folding <br>changes in that language, then Unicode has to adjust. It would be poor <br>service to the public to stick to a case mapping that is no longer valid <br>just because Unicode came into existence at a time when it was still <br>valid. And since capital ⟨ẞ⟩ has existed in Unicode since 2008, the use <br>of ⟨SS⟩ instead of it in a string of Unicode characters has to be <br>regarded as no longer valid (although workarounds with ⟨SS⟩ for fonts <br>that do not contain capital ⟨ẞ⟩ are not regarded as misspellings).<br><br>Capital ⟨ẞ⟩ is officially the normative capital letter corresponding to <br>⟨ß⟩ now. People use it wherever they can. There is not reason not to <br>use it except insufficient technology (or knowledge). So let’s update <br>the technology. Let’s update the Unicode standard.<br><br>Best wishes,<br><br>Daniel<br><br>-- <br>Prof. Dr. Daniel Bunčić<br>===============================================================<br>Slavisches Institut der Universität zu Köln<br>Weyertal 137, D-50931 Köln<br>Telefon: +49 (0)221 470-90535<br>Sprechstunden: <a href="https://uni.koeln/ENZEB" target="_blank" rel="noopener noreferrer">https://uni.koeln/ENZEB</a><br>E-Mail: daniel.buncic@uni-koeln.de = daniel@buncic.de<br>Threema: <a href="https://threema.id/8M375R5K" target="_blank" rel="noopener noreferrer">https://threema.id/8M375R5K</a><br>===============================================================<br>Homepage: <a href="http://daniel.buncic.de/" target="_blank" rel="noopener noreferrer">http://daniel.buncic.de/</a><br>Academia: <a href="http://uni-koeln.academia.edu/buncic" target="_blank" rel="noopener noreferrer">http://uni-koeln.academia.edu/buncic</a><br>ResearchGate: <a href="https://researchgate.net/profile/Daniel-Buncic-2" target="_blank" rel="noopener noreferrer">https://researchgate.net/profile/Daniel-Buncic-2</a><br>===============================================================</div>