global password strategies

Harriet Riddle harjitmoe at outlook.com
Sat Apr 9 16:14:38 CDT 2022


> One problem is indeed that some smart keyboards will alter text
> themselves.  I had a keyboard that enabled one to enter much NFC Latin
> text.  I switched keyboard interpreter (from corrected fcitx to
> ibus) when 32-bit operating systems were discontinued, and now for the
> same M17N keyboard definition it generates NFD at least some of the
> time.

Of course, then you come to CJK input methods.  My favourite go-to 
example is (depending on both vendor and locale) that some will generate 
U+301C 〜 while others will generate U+FF5E ~ for what is supposed to be 
the same character.  Normalisation doesn't help here: U+FF5E NFKCs to 
U+007E, while U+301C NFKCs to itself.

Another example is U+2212 − and U+FF0D -, which was causing enough 
problems that they had to be explicitly written into W3C/WHATWG 
standards as converting to the same JIS, even though the WHATWG Encoding 
Standard doesn't normally include one-way mappings like that, since 
Apple Japanese IMEs were entering U+2212 and Microsoft ones were 
entering U+FF0D.  Although there are multiple other examples where the 
two vendors map the same Shift JIS character to different Unicode, and 
this also affects codepoint choice by their respective IMEs, apparently 
this particular example was causing problems with systems expecting 
Shift JIS postal codes or something like that.  
https://www.w3.org/Bugs/Public/show_bug.cgi?id=28661

Again, normalisation doesn't help there, since U+FF0D NFKCs to U+002D 
and U+2212 NFKCs to itself.  These inter-vendor issues are not unique to 
Japanese either.

Suffice it to say, you don't want your password to fail to match because 
you typed it on a different operating system.

-- Har.


More information about the Unicode mailing list