Hanb in domain labels
Martin J. Dürst
duerst at it.aoyama.ac.jp
Fri Aug 16 09:29:31 CDT 2024
Hello Henri,
I don't know about Chinese and Bopomofo, but for Japanese, there surely
are e.g. company names that contain both Kana and Kanji. And company
names are one (although of course not the only) use case for domain names.
I'm cc'ing Arnt, who is one of the authors of
https://www.ietf.org/archive/id/draft-gulbrandsen-smtputf8-nice-addresses-00.html,
which is about email addresses (quite a bit related to domain names) and
discusses Chinese quite a bit (although it doesn't mention Bopomofo).
Regards, Martin.
P.S.: draft-gulbrandsen-smtputf8-nice-addresses-00.html is in my view
still in a very early stage; I have read through it but still have to
write up my comments.
On 2024-08-15 18:08, Henri Sivonen via Unicode wrote:
> UTS #39 is commonly used as the baseline for detecting IDN spoofs, and UTS
> #39 explicitly allows combining Han and Bopomofo. Considering that ㄚ looks
> confusable with 丫 and ㄠ looks confusable with 幺, I’m wondering if it’s
> appropriate to explicitly allow this combination in the spoof detection
> context. Is combining Han and Bopomofo in one domain label something that
> occurs commonly enough in domains that aren’t intended to be spoofs for it
> being necessary not to treat the script combination as triggering spoof
> detection in the domain name context?
>
More information about the Unicode
mailing list