Confusables.txt might be too sensitive

Marius Spix marius.spix at
Mon Jun 7 07:16:25 CDT 2021

I guess, the problem is that m looks similar to rn. For example,
the domain "pomhub dot com" is easily confusable with a well-known
website. But that also would work the other way around, e. g.
"rnicrosoft dot com". Italic m also looks identical to italic т
(Cyrillic t). But I agree that m should not be considered to be a mock
letter at all, especially in cases where only identifiers [a-zA-Z_] are
allowed for user names. But this will be a task of the
individual implementation, not for Unicode. 

On Sun, 6 Jun 2021 20:48:22 -0700
Mark Dawson via Unicode <unicode at> wrote:

> Dear Unicode Mailing List,
> I am a user of the metamask <> browser extension
> (which is a cryptocurrency wallet). My name always gets flagged as a
> potential scam simply because it contains the small Latin letter "m"
> (codepoint 006D). Someone contributing to the metamask project had
> the idea <>
> to give a warning message if someone is using a name that might
> contain suspicious characters. Seems like a good idea to me.
> The contributor decided to use TR39's confusable.txt
> <>
> file to flag suspicious characters. On line 3344 of the
> confusables.txt, it lists the small Latin letter "m" (codepoint 006D)
> as a source character for a confusable. Is this intentional?
> No other small Latin letter is flagged as a confusable. (Not even the
> letter "o"). Would Unicode consider removing the small Latin letter
> "m" as a source on the confusable.txt?
> Thanks,
> Mark
> [image: image.png]

More information about the Unicode mailing list