Conflicts between UnicodeData.txt and EastAsianWidth.txt?

Mike FABIAN maiku.fabian at gmail.com
Thu Nov 6 06:12:32 CST 2014


http://www.unicode.org/Public/7.0.0/ucd/EastAsianWidth.txt
contains:    

    302A..302D;W     # Mn     [4] IDEOGRAPHIC LEVEL TONE MARK..IDEOGRAPHIC ENTERING TONE MARK
    
which gives us a width of 2 for these 4 characters (because of “W”).

But
http://www.unicode.org/Public/7.0.0/ucd/UnicodeData.txt
contains:

    302A;IDEOGRAPHIC LEVEL TONE MARK;Mn;218;NSM;;;;;N;;;;;
    302B;IDEOGRAPHIC RISING TONE MARK;Mn;228;NSM;;;;;N;;;;;
    302C;IDEOGRAPHIC DEPARTING TONE MARK;Mn;232;NSM;;;;;N;;;;;
    302D;IDEOGRAPHIC ENTERING TONE MARK;Mn;222;NSM;;;;;N;;;;;
    
Doesn’t “NSM” (non spacing mark) imply a with of 0?

Is that a contradition or is this on purpose?

-- 
Mike FABIAN <mfabian at redhat.com>
睡眠不足はいい仕事の敵だ。


More information about the Unicode mailing list