Hanb in domain labels

Martin J. Dürst duerst at it.aoyama.ac.jp
Fri Aug 16 09:29:31 CDT 2024


Hello Henri,

I don't know about Chinese and Bopomofo, but for Japanese, there surely 
are e.g. company names that contain both Kana and Kanji. And company 
names are one (although of course not the only) use case for domain names.

I'm cc'ing Arnt, who is one of the authors of 
https://www.ietf.org/archive/id/draft-gulbrandsen-smtputf8-nice-addresses-00.html, 
which is about email addresses (quite a bit related to domain names) and 
discusses Chinese quite a bit (although it doesn't mention Bopomofo).

Regards,   Martin.

P.S.: draft-gulbrandsen-smtputf8-nice-addresses-00.html is in my view 
still in a very early stage; I have read through it but still have to 
write up my comments.

On 2024-08-15 18:08, Henri Sivonen via Unicode wrote:
> UTS #39 is commonly used as the baseline for detecting IDN spoofs, and UTS
> #39 explicitly allows combining Han and Bopomofo. Considering that ㄚ looks
> confusable with 丫 and ㄠ looks confusable with 幺, I’m wondering if it’s
> appropriate to explicitly allow this combination in the spoof detection
> context. Is combining Han and Bopomofo in one domain label something that
> occurs commonly enough in domains that aren’t intended to be spoofs for it
> being necessary not to treat the script combination as triggering spoof
> detection in the domain name context?
> 


More information about the Unicode mailing list