Decomposition/Compatibility Mapping Issue

Andre Schappo A.Schappo at lboro.ac.uk
Thu Sep 3 03:22:33 CDT 2015


So ............... I was looking at http://unicode.org/cldr/utility/regex.jsp?a=%5Cp%7Bscript%3DHan%7D&b=? and getting a cool looking Modified Regex Pattern. The last range ??-?? is CJK Compatibility Ideographs Supplement U+2F800-2FA1D.


[?-??-??-????-??-??-??-??-??-???-????-????-????-????-??]


So ....... then ....... I decided to copy/paste the above Modified Regex Pattern into Richard Ishida's Uniview http://r12a.github.io/uniview/


So ........ I then noticed that ?? U+2F800 was listed as ? U+4E3D  [CJK Unified Ideographs]


Thus the decomposition/compatibility mapping U+4E3D was being substituted for the original U+2F800.


I was using Safari on OS X Yosemite. I repeated the above with Chrome and Firefox and there was no problem, no substitution occurred. Thus it appears to be a copy/paste problem with Safari or code used by Safari.


I could have so easily missed this problem. I wonder if there are similar decomposition/compatibility mapping issues.


André Schappo

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://unicode.org/pipermail/unicode/attachments/20150903/0c750edf/attachment.html>


More information about the Unicode mailing list