Should U+3248 ... U+324F be wide characters?

Mike FABIAN via Unicode unicode at unicode.org
Thu Aug 17 09:24:08 CDT 2017


Asmus Freytag via Unicode <unicode at unicode.org> さんはかきました:

> On 8/16/2017 6:26 AM, Mike FABIAN via Unicode wrote:
>
>     EastAsianWidth.txt contains:
>     
>     3248..324F;A     # No     [8] CIRCLED NUMBER TEN ON BLACK SQUARE..CIRCLED NUMBER EIGHTY ON BLACK SQUARE
>     
>     i.e. it classifies the width of the characters at codepoints
>     between 3248 and 324F as ambiguous.
>     
>     Is this really correct? Shouldn’t they be “W”, i.e. wide?
>     
>     In most fonts these characters seem to be square shaped wide characters.
>
> "W" not only implies display width, but also a different treatment in the context of line
> breaking and vertical layout of text.
>
> "W" characters behave more like Ideographs, for the most part, while "N" are treated as
> forming words (for the most part).

Most emoji now have "W", for example:

1F600..1F64F;W   # So    [80] GRINNING FACE..PERSON WITH FOLDED HANDS

That seems correct because emoji behave more like Ideographs.

Isn’t this the same for “CIRCLED NUMBER TEN ON BLACK SQUARE”?
This seems to me also more like an Ideograph.

> "A" means, you get to decide whether to treat these as "W" or "N" based on context. If
> used in a non ideographic context, they behave like all other symbols (but happen to fill
> an EM square).
>
> A./
>

-- 
Mike FABIAN <mfabian at redhat.com>
睡眠不足はいい仕事の敵だ。


More information about the Unicode mailing list