<html><head><meta http-equiv="Content-Type" content="text/html; charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><br class=""><div><br class=""><blockquote type="cite" class=""><div class="">12 okt. 2023 kl. 00:37 skrev Doug Ewell via Unicode <<a href="mailto:unicode@corp.unicode.org" class="">unicode@corp.unicode.org</a>>:</div><br class="Apple-interchange-newline"><div class=""><div class="">Kent Karlsson wrote:<br class=""><br class=""><blockquote type="cite" class=""><blockquote type="cite" class="">Letʼs consider an equation that youʼll probably recognize, font<br class="">support willing: 𝐸 = 𝑚𝑐². Thanks to the power of Unicode, we could<br class="">use it in the same plain‐text document as, say, ℰ = 𝐦𝕔² while<br class="">keeping both<br class=""></blockquote><br class="">That's not really a proper way of representing math expressions.<br class="">For one thing, compatibility normalisation would ruin them (true,<br class="">one is not supposed to apply that, which I agree with, but it<br class="">sometimes is anyway).<br class=""></blockquote><br class="">I see this claim from time to time, and not only from Kent: we must not use character (sequence) X, or must not use it in contrast with character (sequence) Y which is compatibility-equivalent to X, because some random, unknown process might surreptitiously apply NFKC or NFKD to the text, obliterating the distinction.<br class=""><br class="">Can Kent, or anyone else, please identify a *specific* program or process that does this?<br class=""><br class="">If there are no attested, real-world examples of processes actually applying NFKC or NFKD behind the user’s back (which would indeed be evil), I’m likely to write this off as an urban myth.<br class=""></div></div></blockquote><font color="#5856d6" class=""><div style="caret-color: rgb(88, 86, 214);"><font color="#5856d6" class=""><span style="caret-color: rgb(88, 86, 214);" class=""><br class=""></span></font></div>It would be absolutely wonderful if it could (now) be written off, perhaps not as urban myth, but as old bugs. There have been even worse cases, removing ”accents” on e.g. åäö (ICU even has support for such a mapping). Just today, I saw a brand new(!) message where single apostrophe (not the ASCII one) somehow had been automatically replaced by three(!) question marks, likewise for some bullet point character (don’t know which one it was originally). So, while not NFKD/NFKC, that kind of ”downgrading<span style="caret-color: rgb(88, 86, 214);" class="">” changes to text still happen.</span></font></div><div><font color="#5856d6" class=""><span style="caret-color: rgb(88, 86, 214);" class=""><br class=""></span></font></div><div><font color="#5856d6" class=""><span style="caret-color: rgb(88, 86, 214);" class="">Now, I do not have access to, let alone able to test, all the software in the world…</span></font></div><div><font color="#5856d6" class=""><span style="caret-color: rgb(88, 86, 214);" class=""><br class=""></span></font></div><div><font color="#5856d6" class=""><span style="caret-color: rgb(88, 86, 214);" class="">/Kent K</span></font></div><div><font color="#5856d6" class=""><br class=""></font><blockquote type="cite" class=""><div class=""><div class="">--<br class="">Doug Ewell, CC, ALB | Lakewood, CO, US | <a href="http://ewellic.org" class="">ewellic.org</a><br class=""><br class=""></div></div></blockquote></div></body></html>