From everson at evertype.com Sat Apr 1 12:24:25 2017 From: everson at evertype.com (Michael Everson) Date: Sat, 1 Apr 2017 19:24:25 +0200 Subject: Proposal to add standardized variation sequences for chess notation Message-ID: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> Variation Sequences have been implemented for a number of symbol characters recently to make them useful for specialized purposes. Here is a proposal which solves a long-standing problem for an important set of symbols in the UCS. https://www.dropbox.com/sh/p9vga1dc2t02pqw/AABL4XwI-ZERDbnLJmvJJvtja?dl=0 Enjoy, Michael Everson From verdy_p at wanadoo.fr Sat Apr 1 14:21:36 2017 From: verdy_p at wanadoo.fr (Philippe Verdy) Date: Sat, 1 Apr 2017 21:21:36 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> Message-ID: I like these proposed border-box charcters which where clearly missing in the box-drawing set (where they exist only when they pass through the center of a cell. However, unless they are are ujssed in monoxpaced fonts, I don't think that all of them have to match the same width as the checkers cells, notaby the 2 vertical and 4 corner ones which can clearly be narrower (only the 2 horizontal ones, top or bottom, need to match the cell). Also, if a variation selector is used for a white or black square, the rendering should still extend the width the pieces drawn inside to center them in a square board cell. Pieces without these background selectors can still be using proportional width (for example in texts showing a game play positions). Note also that for draughts pieces, in French they are not called "homme" (=man) and "roi" (=king), but "pion" (=pawn) and "dame" (or "reine", both meaning "queen" in chess, draughts and card decks games: the "draught" game itself is named "dames" with the plural). Many draughts and chess players may use chess pieces to play draughts (if there's not enough king/queen in chess pieces, they can as well use other pieces except pawns). The board itself may be any suitable grid. Some will use or grains/small rocks for pawns and real money coins (white metal vs.yellow/red metal) for king/queen. In classrooms (where pieces are too frequently lost), children build their own pieces only with colored paper/carton and every player has in fact played with friends/family using such substitutes, and it is even easier and more friendly than playing now with two small smartphones/tablets with a connected app (those apps don't need Unicode encoding at all, they use their own graphics). 2017-04-01 19:24 GMT+02:00 Michael Everson : > Variation Sequences have been implemented for a number of symbol > characters recently to make them useful for specialized purposes. > > Here is a proposal which solves a long-standing problem for an important > set of symbols in the UCS. > > https://www.dropbox.com/sh/p9vga1dc2t02pqw/AABL4XwI-ZERDbnLJmvJJvtja?dl=0 > > Enjoy, > Michael Everson > -------------- next part -------------- An HTML attachment was scrubbed... URL: From christoph.paeper at crissov.de Sat Apr 1 14:57:07 2017 From: christoph.paeper at crissov.de (=?UTF-8?Q?Christoph_P=C3=A4per?=) Date: Sat, 1 Apr 2017 21:57:07 +0200 (CEST) Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> Message-ID: <1766481648.24541.1491076627544.JavaMail.open-xchange@app08.ox.hosteurope.de> Michael Everson : > > Variation Sequences have been implemented for a number of symbol characters > recently to make them useful for specialized purposes. This is were I still suspected there was an April Fools joke coming up. > Here is a proposal which solves a long-standing problem for an important set > of symbols in the UCS. # Chesspiece on white versus Chesspiece on black variation sequences 25A1 FE00; White chessboard square; # WHITE SQUARE 25A8 FE01; Black chessboard square; # SQUARE WITH UPPER RIGHT TO LOWER LEFT FILL 2654 FE00; Chesspiece on white; # WHITE CHESS KING 2654 FE01; Chesspiece on black; # WHITE CHESS KING 2655 FE00; Chesspiece on white; # WHITE CHESS QUEEN 2655 FE01; Chesspiece on black; # WHITE CHESS QUEEN 2656 FE00; Chesspiece on white; # WHITE CHESS ROOK 2656 FE01; Chesspiece on black; # WHITE CHESS ROOK 2657 FE00; Chesspiece on white; # WHITE CHESS BISHOP 2657 FE01; Chesspiece on black; # WHITE CHESS BISHOP 2658 FE00; Chesspiece on white; # WHITE CHESS KNIGHT 2658 FE01; Chesspiece on black; # WHITE CHESS KNIGHT 2659 FE00; Chesspiece on white; # WHITE CHESS PAWN 2659 FE01; Chesspiece on black; # WHITE CHESS PAWN 265A FE00; Chesspiece on white; # BLACK CHESS KING 265A FE01; Chesspiece on black; # BLACK CHESS KING 265B FE00; Chesspiece on white; # BLACK CHESS QUEEN 265B FE01; Chesspiece on black; # BLACK CHESS QUEEN 265C FE00; Chesspiece on white; # BLACK CHESS ROOK 265C FE01; Chesspiece on black; # BLACK CHESS ROOK 265D FE00; Chesspiece on white; # BLACK CHESS BISHOP 265D FE01; Chesspiece on black; # BLACK CHESS BISHOP 265E FE00; Chesspiece on white; # BLACK CHESS KNIGHT 265E FE01; Chesspiece on black; # BLACK CHESS KNIGHT 265F FE00; Chesspiece on white; # BLACK CHESS PAWN 265F FE01; Chesspiece on black; # BLACK CHESS PAWN 26C0 FE00; Draughts piece on white; # WHITE DRAUGHTS MAN 26C0 FE01; Draughts piece on black; # WHITE DRAUGHTS MAN 26C1 FE00; Draughts piece on white; # WHITE DRAUGHTS KING 26C1 FE01; Draughts piece on black; # WHITE DRAUGHTS KING 26C2 FE00; Draughts piece on white; # BLACK DRAUGHTS MAN 26C2 FE01; Draughts piece on black; # BLACK DRAUGHTS MAN 26C3 FE00; Draughts piece on white; # BLACK DRAUGHTS KING 26C3 FE01; Draughts piece on black; # BLACK DRAUGHTS KING ? U+25A1 and, especially, ? U+25A8 for empty fields on a board make no sense. U+25A8 always shows as diagonals from the lower left to the upper right (much like a forward slash /). Black fields are often hatched this way, but could also be shown with a solid fill ? U+25A0, a reverse diagonal fill ? U+25A7, a diamond pattern (diagonal crosshatch) ? U+25A9, a square pattern (orthogonal crosshatch) ? U+25A6, a vertical pattern ? U+25A5 or a horizontal pattern ? U+25A4. I suggest you adopt a space character instead, e.g. U+2003 Em Space or U+2001 Em Quad. 2003 FE00; White chessboard square; # EM SPACE 2003 FE01; Black chessboard square; # EM SPACE 2001 FE00; White chessboard square; # EM QUAD 2001 FE01; Black chessboard square; # EM QUAD ? U+25A1, ? U+25A2 or ? U+2B1A would also work if you wanted a minimal amount of ink but not none. 25A1 FE00; White chessboard square; # WHITE SQUARE 25A1 FE01; Black chessboard square; # WHITE SQUARE 25A2 FE00; White chessboard square; # WHITE SQUARE WITH ROUNDED CORNERS 25A2 FE01; Black chessboard square; # WHITE SQUARE WITH ROUNDED CORNERS 2B1A FE00; White chessboard square; # DOTTED SQUARE 2B1A FE01; Black chessboard square; # DOTTED SQUARE You should also evaluate a different approach altogether: 20DE FE00; Combining white chessboard square; # COMBINING ENCLOSING SQUARE 20DE FE01; Combining black chessboard square; # COMBINING ENCLOSING SQUARE 20DE FE00; Combining white background; # COMBINING ENCLOSING SQUARE 20DE FE01; Combining black background; # COMBINING ENCLOSING SQUARE Although one would need to combine it with a space character as a base for empty fields, this would require only two new entries in StandardizedVariants.txt and be more flexible regarding alternate (Fairy Chess) game pieces ? including emojis. From everson at evertype.com Sat Apr 1 15:03:31 2017 From: everson at evertype.com (Michael Everson) Date: Sat, 1 Apr 2017 22:03:31 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> Message-ID: <1D03D028-D29C-4846-BCF0-0D15A7C30A2D@evertype.com> On 1 Apr 2017, at 21:21, Philippe Verdy wrote: > > I like these proposed border-box charcters which where clearly missing in the box-drawing set (where they exist only when they pass through the center of a cell. This document does not propose any new characters. Michael Everson From everson at evertype.com Sat Apr 1 15:35:59 2017 From: everson at evertype.com (Michael Everson) Date: Sat, 1 Apr 2017 22:35:59 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <1766481648.24541.1491076627544.JavaMail.open-xchange@app08.ox.hosteurope.de> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> <1766481648.24541.1491076627544.JavaMail.open-xchange@app08.ox.hosteurope.de> Message-ID: On 1 Apr 2017, at 21:57, Christoph P?per wrote: > ? U+25A1 and, especially, ? U+25A8 for empty fields on a board make no sense. Not so. Think about the data. > U+25A8 always shows as diagonals from the lower left to the upper right (much like a forward slash /). Black fields are often hatched this way, but could also be shown with a solid fill ? U+25A0, a reverse diagonal fill ? U+25A7, a diamond > pattern (diagonal crosshatch) ? U+25A9, a square pattern (orthogonal crosshatch) ? U+25A6, a vertical pattern ? U+25A5 or a horizontal pattern ? U+25A4. The *conventional* glyph used in international chess diagrams uses the character I chose (with the /// diagonals). That?s the character which should be used to represent chess boards in plain text. Nothing prevents a font designer from choosing to render it (in a chess font supporting this protocol) with a vertical pattern, or with dots, or with as black, or whatever. Please distinguish characters from glyphs. > I suggest you adopt a space character instead, e.g. U+2003 Em Space or U+2001 Em Quad. No. Absolutely not. Spaces have a variety of properties. Spaces separate things, but are not things themselves. The white square on a chessbord is not a separating nothingness. It?s a white square. Even when a chessboard is made of green and brown marble, one is still a white square, and one a black square. Even when chess pieces are made of yellow and red plastic, one is still a white piece and one is still a black piece. In this proposal the squares and the pieces are all graphic symbols all with the So (Symbol Other) property. Using space characters you suggest would be a mistake; they have the Za (Space Separator) property. > ? U+25A1, ? U+25A2 or ? U+2B1A would also work if you wanted a minimal amount of ink but not none. Christoph, I?ve already implemented this and it works well and robustly. Glyphs could be altered in a variety of ways, but the point is that this is the kind of simple higher level protocol which will solve a long-standing problem simply and easily, and allow the parsing of chess problems as text for analysis, and allow the generation of chess problem images from other descriptions of chess problems and solutions. > 25A1 FE00; White chessboard square; # WHITE SQUARE > 25A1 FE01; Black chessboard square; # WHITE SQUARE > > 25A2 FE00; White chessboard square; # WHITE SQUARE WITH ROUNDED CORNERS > 25A2 FE01; Black chessboard square; # WHITE SQUARE WITH ROUNDED CORNERS > > 2B1A FE00; White chessboard square; # DOTTED SQUARE > 2B1A FE01; Black chessboard square; # DOTTED SQUARE Again there?s no need to use a variety of characters to represent the chess squares. If you want to draw them in your font with a certain non-/// fill, you could. But that is cosmetic and irrelevant. The point is to have the underlying board data as plain text, and that means just using the chess pieces and conventional white and black squares. Remember, the /// is drawn around the chess pieces with the FE01 but for those there is no 25A8 used. Please examine the non-OpenType plain text representations in Figure. They?re even readable by humans. > You should also evaluate a different approach altogether: > > 20DE FE00; Combining white chessboard square; # COMBINING ENCLOSING SQUARE > 20DE FE01; Combining black chessboard square; # COMBINING ENCLOSING SQUARE > > 20DE FE00; Combining white background; # COMBINING ENCLOSING SQUARE > 20DE FE01; Combining black background; # COMBINING ENCLOSING SQUARE > > Although one would need to combine it with a space character as a base for empty fields, That?s not remotely tempting. It would offer no advantage and would needlessly complicate the system. > this would require only two new entries in StandardizedVariants.txt and be more flexible regarding alternate (Fairy Chess) game pieces ? including emojis. This proposal has nothing to do with emojis. This is a plain-text protocol for the representation of chessboard data in a parseable fashion. Should fairy chess characters be added to the standard, some additional entries would be added to StandardizedVariants.txt, yes. This is a finite set, however, and this should not be problematic to anybody. It?s certainly simpler than a number of other recommended sequences which have been added to the standard for other purposes. I thank you, sincerely, for your interest in this proposal; it has been considered and tested and it works better than what you have proposed, however. I could prepare additional fonts using dotted or black glyphs for the black squares as you suggest, but the strength of this proposal is that you could achieve those glyphs by simply switching from one font to another, with the underlying chess data preserved. Michael Everson From gwalla at gmail.com Sat Apr 1 15:42:01 2017 From: gwalla at gmail.com (Garth Wallace) Date: Sat, 1 Apr 2017 13:42:01 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <1766481648.24541.1491076627544.JavaMail.open-xchange@app08.ox.hosteurope.de> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> <1766481648.24541.1491076627544.JavaMail.open-xchange@app08.ox.hosteurope.de> Message-ID: On Sat, Apr 1, 2017 at 12:57 PM, Christoph P?per < christoph.paeper at crissov.de> wrote: > Michael Everson : > > > > Variation Sequences have been implemented for a number of symbol > characters > > recently to make them useful for specialized purposes. > > This is were I still suspected there was an April Fools joke coming up. > > > Here is a proposal which solves a long-standing problem for an important > set > > of symbols in the UCS. > > # Chesspiece on white versus Chesspiece on black variation sequences > 25A1 FE00; White chessboard square; # WHITE SQUARE > 25A8 FE01; Black chessboard square; # SQUARE WITH UPPER RIGHT TO LOWER > LEFT > FILL > 2654 FE00; Chesspiece on white; # WHITE CHESS KING > 2654 FE01; Chesspiece on black; # WHITE CHESS KING > 2655 FE00; Chesspiece on white; # WHITE CHESS QUEEN > 2655 FE01; Chesspiece on black; # WHITE CHESS QUEEN > 2656 FE00; Chesspiece on white; # WHITE CHESS ROOK > 2656 FE01; Chesspiece on black; # WHITE CHESS ROOK > 2657 FE00; Chesspiece on white; # WHITE CHESS BISHOP > 2657 FE01; Chesspiece on black; # WHITE CHESS BISHOP > 2658 FE00; Chesspiece on white; # WHITE CHESS KNIGHT > 2658 FE01; Chesspiece on black; # WHITE CHESS KNIGHT > 2659 FE00; Chesspiece on white; # WHITE CHESS PAWN > 2659 FE01; Chesspiece on black; # WHITE CHESS PAWN > 265A FE00; Chesspiece on white; # BLACK CHESS KING > 265A FE01; Chesspiece on black; # BLACK CHESS KING > 265B FE00; Chesspiece on white; # BLACK CHESS QUEEN > 265B FE01; Chesspiece on black; # BLACK CHESS QUEEN > 265C FE00; Chesspiece on white; # BLACK CHESS ROOK > 265C FE01; Chesspiece on black; # BLACK CHESS ROOK > 265D FE00; Chesspiece on white; # BLACK CHESS BISHOP > 265D FE01; Chesspiece on black; # BLACK CHESS BISHOP > 265E FE00; Chesspiece on white; # BLACK CHESS KNIGHT > 265E FE01; Chesspiece on black; # BLACK CHESS KNIGHT > 265F FE00; Chesspiece on white; # BLACK CHESS PAWN > 265F FE01; Chesspiece on black; # BLACK CHESS PAWN > 26C0 FE00; Draughts piece on white; # WHITE DRAUGHTS MAN > 26C0 FE01; Draughts piece on black; # WHITE DRAUGHTS MAN > 26C1 FE00; Draughts piece on white; # WHITE DRAUGHTS KING > 26C1 FE01; Draughts piece on black; # WHITE DRAUGHTS KING > 26C2 FE00; Draughts piece on white; # BLACK DRAUGHTS MAN > 26C2 FE01; Draughts piece on black; # BLACK DRAUGHTS MAN > 26C3 FE00; Draughts piece on white; # BLACK DRAUGHTS KING > 26C3 FE01; Draughts piece on black; # BLACK DRAUGHTS KING > > ? U+25A1 and, especially, ? U+25A8 for empty fields on a board make no > sense. > U+25A8 always shows as diagonals from the lower left to the upper right > (much > like a forward slash /). Black fields are often hatched this way, but > could also > be shown with a solid fill ? U+25A0, a reverse diagonal fill ? U+25A7, a > diamond > pattern (diagonal crosshatch) ? U+25A9, a square pattern (orthogonal > crosshatch) > ? U+25A6, a vertical pattern ? U+25A5 or a horizontal pattern ? U+25A4. Technically any of those shadings would be understood (and I doubt if anyone would notice if the lines ran in the other diagonal direction), but in practice dark squares in typeset diagrams are almost invariably hatched in the bottom left to top right direction. Diagrams in image form may use solid color fill, but that's not relevant to Unicode: this proposal is meant to provide a standardized basis for the existing practice of typesetting chess diagrams in black-and-white text, not to supplant images. > I suggest you adopt a space character instead, e.g. U+2003 Em Space or > U+2001 Em > Quad. > > 2003 FE00; White chessboard square; # EM SPACE > 2003 FE01; Black chessboard square; # EM SPACE > > 2001 FE00; White chessboard square; # EM QUAD > 2001 FE01; Black chessboard square; # EM QUAD > > ? U+25A1, ? U+25A2 or ? U+2B1A would also work if you wanted a minimal > amount of > ink but not none. > > 25A1 FE00; White chessboard square; # WHITE SQUARE > 25A1 FE01; Black chessboard square; # WHITE SQUARE > > 25A2 FE00; White chessboard square; # WHITE SQUARE WITH ROUNDED CORNERS > 25A2 FE01; Black chessboard square; # WHITE SQUARE WITH ROUNDED CORNERS > > 2B1A FE00; White chessboard square; # DOTTED SQUARE > 2B1A FE01; Black chessboard square; # DOTTED SQUARE > > You should also evaluate a different approach altogether: > > 20DE FE00; Combining white chessboard square; # COMBINING ENCLOSING > SQUARE > 20DE FE01; Combining black chessboard square; # COMBINING ENCLOSING > SQUARE > > 20DE FE00; Combining white background; # COMBINING ENCLOSING SQUARE > 20DE FE01; Combining black background; # COMBINING ENCLOSING SQUARE > > Although one would need to combine it with a space character as a base for > empty > fields, this would require only two new entries in > StandardizedVariants.txt and > be more flexible regarding alternate (Fairy Chess) game pieces COMBINING ENCLOSING SQUARE already has its own uses in fairy chess problems, to mark pieces with additional properties (such as paralyzing pieces or magic pieces) or transient identities (chameleons and half-neutrals). It would not be appropriate for this purpose. > including emojis. > No chess symbols, encoded or proposed, are emoji, nor should they be. -------------- next part -------------- An HTML attachment was scrubbed... URL: From 637275 at gmail.com Sat Apr 1 16:09:55 2017 From: 637275 at gmail.com (Rebecca T) Date: Sat, 1 Apr 2017 17:09:55 -0400 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> <1766481648.24541.1491076627544.JavaMail.open-xchange@app08.ox.hosteurope.de> Message-ID: > No chess symbols, encoded or proposed, are emoji, nor should they be. Except on Samsung . -------------- next part -------------- An HTML attachment was scrubbed... URL: From kent.karlsson14 at telia.com Sat Apr 1 16:30:39 2017 From: kent.karlsson14 at telia.com (Kent Karlsson) Date: Sat, 01 Apr 2017 23:30:39 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> Message-ID: 2654 FE00; Chesspiece on white; # WHITE CHESS KING Why do the ones with white background need a variation selector? 25A1 FE00; White chessboard square; # WHITE SQUARE 25A8 FE01; Black chessboard square; # SQUARE WITH UPPER RIGHT TO LOWER LEFT FILL I see that you want a fallback in case the variation selectors aren't supported; but isn't the convention that one "always" start with FE00 for each character that may have variation selectors applied? So in this case, one would only need variation selector FE00; if applied to 25A1 or 25A8 giving the chess board variety, if applied to a chess piece character, gives "checkered" ("black") background (without, one gets the white background). Why not use 25A0 BLACK SQUARE with the variation selector? (I know that it would not entirely black with the variation selector (if not fallback).) I mean, there is no absolute LOGICAL NEED to draw the "black" background as WITH UPPER RIGHT TO LOWER LEFT FILL, it could go the other direction or be just "gray" (or for that matter medium blue...); font maker choice. Kind regards /Kent K From verdy_p at wanadoo.fr Sat Apr 1 16:35:33 2017 From: verdy_p at wanadoo.fr (Philippe Verdy) Date: Sat, 1 Apr 2017 23:35:33 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> <1766481648.24541.1491076627544.JavaMail.open-xchange@app08.ox.hosteurope.de> Message-ID: 2017-04-01 23:09 GMT+02:00 Rebecca T <637275 at gmail.com>: > > No chess symbols, encoded or proposed, are emoji, nor should they be. > > Except on Samsung > > . > Except that the sample in this article mixes the colors (black symbol, vs. white piece emoji, only slighjly darkened with its 3D shadows) I hope that Samsung is making a clear distinction in its emojis, otherwise it is not a replacement of the symbol, and skin color modifiers "may" have been used to. Note that the previous discussion talks about black and white patterns, but in reality the patterns are just there to emulate color or lightness/darkness. I don't think there's ny realy difference if the pattern hashes are oriented like /// or \\\, or if grid patterns are rotated 0? 30?, 45?: these patterns are used to get a visual feeling as the exact number of stroke is not significant (only the visual black vs/white coverate rate is significant and high resolution devices may freely use thinner stroke widths depending on pixel/subpixel sizes, optical filters or ink droplets/powders sizes and absorbtion/diffusion by the printing support). The same appllies too for human skin color modifiers. On typical color (or grey) displays or polychrmatic printing, these patterns will not be used, real colorized fills will be used for more clarity. Today's printing techniques use much higher precision, and patterns used on old books or maps are no longer needed (and paper surface quality/regularity today is much better than what it was in the past, even for basic newspapers using cheap recycled paper, where polychromatic printing is also used, notably on pages related to leasure time, games, TV programs, wheather maps, photos of celebrities... and adertizing! Printing masks are generated using hi resolution lasers and ink quality is much better too). -------------- next part -------------- An HTML attachment was scrubbed... URL: From verdy_p at wanadoo.fr Sat Apr 1 16:49:30 2017 From: verdy_p at wanadoo.fr (Philippe Verdy) Date: Sat, 1 Apr 2017 23:49:30 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> Message-ID: I think it's all about sizing so that white or black cells will align, independantly of the piece that may be within it. However if FE00 and FE01 give the background color distinction, why is the base different (25A1 vis. 25A8) for the empty cell, when it is no different for the same piece (in the same white or black color) ? I see the separation of the base only for the borders (touching outside the checkers board), which may also be reduced to a minimal thin edge over a small margin... or nothing at all (completely transparent) if the font already includes a thin contrasting cell on the squares (e.g. grey ridges with 3D effects). The outer backdound on which these borders are drawn may also be already contrasting with another color (yellow, green, blue), and checkers may also use other pairs of contrasting colors (e.g. beige/ivery vs. brown): The FE00 and FE01 select an Emoji style with more freedom in shapes and colors for the piece and more precise and coherent sizes but a required square cell. Their absence just means an isolated piece outside the checker board and without required backgrounds or without monospaced margins, suitable for inclusion in text. 2017-04-01 23:30 GMT+02:00 Kent Karlsson : > > 2654 FE00; Chesspiece on white; # WHITE CHESS KING > > Why do the ones with white background need a variation selector? > > 25A1 FE00; White chessboard square; # WHITE SQUARE > 25A8 FE01; Black chessboard square; # SQUARE WITH UPPER RIGHT TO LOWER LEFT > FILL > > I see that you want a fallback in case the variation selectors aren't > supported; but isn't the convention that one "always" start with FE00 > for each character that may have variation selectors applied? > > So in this case, one would only need variation selector FE00; if applied > to 25A1 or 25A8 giving the chess board variety, if applied to a chess piece > character, gives "checkered" ("black") background (without, one gets the > white background). > > Why not use 25A0 BLACK SQUARE with the variation selector? (I know that > it would not entirely black with the variation selector (if not fallback).) > I mean, there is no absolute LOGICAL NEED to draw the "black" background > as WITH UPPER RIGHT TO LOWER LEFT FILL, it could go the other direction > or be just "gray" (or for that matter medium blue...); font maker choice. > > Kind regards > /Kent K > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From kent.karlsson14 at telia.com Sat Apr 1 16:57:06 2017 From: kent.karlsson14 at telia.com (Kent Karlsson) Date: Sat, 01 Apr 2017 23:57:06 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> Message-ID: In addition, not directly related to your proposal, why aren't chess pieces listed in http://unicode.org/emoji/charts/emoji-variants.html. It seems to me that chess pieces would be very well suited to have each an emoji variant (not to be used for the chess boards, maybe). /Kent K PS Remember that Emoji style (or not) uses two OTHER variation selectors, FE0F (and FE0E). From everson at evertype.com Sat Apr 1 18:31:19 2017 From: everson at evertype.com (Michael Everson) Date: Sun, 2 Apr 2017 01:31:19 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: Message-ID: <2AB2D979-2370-4E95-897D-4D499472B4B2@evertype.com> Kent, Please do not drag chess pieces into discussions about emoji right now. Do it later if you must. This proposal is designed to proved a standardized higher-level protocol for the use of chess characters in chess data, to enable the chess community to make good use of the long-encoded chess characters. Michael > On 1 Apr 2017, at 23:57, Kent Karlsson wrote: > > In addition, not directly related to your proposal, why aren?t chess pieces listed in http://unicode.org/emoji/charts/emoji-variants.html. > > It seems to me that chess pieces would be very well suited to have each an emoji variant (not to be used for the chess boards, maybe). > > /Kent K > > PS > Remember that Emoji style (or not) uses two OTHER variation selectors, FE0F (and FE0E). > From everson at evertype.com Sat Apr 1 18:33:14 2017 From: everson at evertype.com (Michael Everson) Date: Sun, 2 Apr 2017 01:33:14 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: Message-ID: <2B915B48-06DB-4C80-AEEE-BE255D2DD407@evertype.com> On 1 Apr 2017, at 23:30, Kent Karlsson wrote: > 2654 FE00; Chesspiece on white; # WHITE CHESS KING > > Why do the ones with white background need a variation selector? Because for the typesetting to work the glyph has to have the same precise square metrics as the ones on the black square (it is not a ?background?), and the chess characters when used as ordinary symbols in text need not have such metrics. (And do not, in most fonts.) > 25A1 FE00; White chessboard square; # WHITE SQUARE > 25A8 FE01; Black chessboard square; # SQUARE WITH UPPER RIGHT TO LOWER LEFT FILL > > I see that you want a fallback in case the variation selectors aren?t supported; I am not sure what you mean. If the variation selector isn?t supported then the glyph will not have metrics suitable for setting a chessboard. > but isn't the convention that one "always" start with FE00 for each character that may have variation selectors applied? I don?t know what you mean by this. As shown in Figure 2, a white knight for instance may occur on its own, or may occur on a white board square or on a black board square. I don?t think the first need differentiation, which is why the variation sequences apply only to the on?board-square glyphs. > So in this case, one would only need variation selector FE00; if applied to 25A1 or 25A8 giving the chess board variety, if applied to a chess piece character, gives "checkered" ("black") background (without, one gets the white background). No, a chesspiece symbol can (and nearly always does) appear on its own in text without square metrics. ?Being on a white square? is a specific glyph state, different from ?being a symbol on its own?. > Why not use 25A0 BLACK SQUARE with the variation selector? (I know that it would not entirely black with the variation selector (if not fallback).) Because the conventional international shading for a black square is the /// one, and using that facilitates legibility in environments where OpenType features are not enabled even if the VS characters are present. > I mean, there is no absolute LOGICAL NEED to draw the "black? background as WITH UPPER RIGHT TO LOWER LEFT FILL, it could go the other direction or be just "gray" (or for that matter medium blue...); font maker choice. Since it doesn?t ?matter" what character is used I chose the one which is most typical, and stand by that choice. All the best, Michael Everson > Kind regards > /Kent K From gwalla at gmail.com Sat Apr 1 19:50:16 2017 From: gwalla at gmail.com (Garth Wallace) Date: Sat, 1 Apr 2017 17:50:16 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> <1766481648.24541.1491076627544.JavaMail.open-xchange@app08.ox.hosteurope.de> Message-ID: On Sat, Apr 1, 2017 at 2:09 PM, Rebecca T <637275 at gmail.com> wrote: > > No chess symbols, encoded or proposed, are emoji, nor should they be. > > Except on Samsung > > . > They do not *officially* have emoji presentation. Samsung does what it wants. -------------- next part -------------- An HTML attachment was scrubbed... URL: From kent.karlsson14 at telia.com Sat Apr 1 20:16:39 2017 From: kent.karlsson14 at telia.com (Kent Karlsson) Date: Sun, 02 Apr 2017 03:16:39 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <2B915B48-06DB-4C80-AEEE-BE255D2DD407@evertype.com> Message-ID: Den 2017-04-02 01:33, skrev "Michael Everson" : >> but isn't the convention that one "always" start with FE00 for each character >> that may have variation selectors applied? > > I don?t know what you mean by this. > 25A8 FE01; Black chessboard square; # SQUARE WITH UPPER RIGHT TO LOWER LEFT FILL In this case, the "set of variation selectors" for 25A8 excludes FE00. /Kent K From everson at evertype.com Sun Apr 2 01:19:00 2017 From: everson at evertype.com (Michael Everson) Date: Sun, 2 Apr 2017 08:19:00 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

Message-ID: > On 1 Apr 2017, at 23:49, Philippe Verdy wrote: > > I think it's all about sizing so that white or black cells will align, independantly of the piece that may be within it. A white knight may stand alone in text, in which case no variation exists for display beyond the base glyph in the font. A white knight may need to be represented with specific em-square-related metrics within the font, in two variations, one with no fill on the background of the em-square indicating the piece on a white square, and one with a (typically ///-shaped) fill indicating the piece on a black square. > However if FE00 and FE01 give the background color distinction, why is the base different (25A1 vis. 25A8) for the empty cell, when it is no different for the same piece (in the same white or black color) ? In the second row in Figure 3, a chessboard is given without any variation selectors at all. This is not beautiful presentation but it is nevertheless legible in plain text. This is more advantageous to the user than having e.g. WHITE SQUARE display both as white with em-square square metrics and as hatched with em-square metrics. > I see the separation of the base only for the borders (touching outside the checkers board), The characters used for the horizontal and vertical borders and corners are optional and do not require variation selectors. In a chess font they only require to be drawn with the appropriate metrics to match up with the board squares. > which may also be reduced to a minimal thin edge over a small margin... or nothing at all (completely transparent) if the font already includes a thin contrasting cell on the squares (e.g. grey ridges with 3D effects). The outer backdound on which these borders are drawn may also be already contrasting with another color (yellow, green, blue), and checkers may also use other pairs of contrasting colors (e.g. beige/ivery vs. brown): Those proposal is about black and white glyphs for ordinary printing of plain text with appropriate font glyphs in the conventional way of displaying chessboard data. In lead type, a white knight in a white square was a separate character from a white knight on a black square. This proposal uses variation selectors to select such glyphs, preserving character identity of chess pieces as already encoded. The alternative would be to encode *WHITE CHEESE KNIGHT ON BLACK SQUARE which when mooted in the past was rejected. > The FE00 and FE01 select an Emoji style NO, THEY DO NOT. > with more freedom in shapes and colors for the piece and more precise and coherent sizes but a required square cell. Their absence just means an isolated piece outside the checker board and without required backgrounds or without monospaced margins, suitable for inclusion in text. The proposal does no more nor less than it says it does. It has been carefully thought out and tested in font implementation, typeset as you see in the proposal in a program which respects the OpenType features. Michael Everson From richard.wordingham at ntlworld.com Sun Apr 2 03:53:22 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Sun, 2 Apr 2017 09:53:22 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

Message-ID: <20170402095322.17526d87@JRWUBU2> On Sun, 2 Apr 2017 08:19:00 +0200 Michael Everson wrote: > > On 1 Apr 2017, at 23:49, Philippe Verdy wrote: > > > > I think it's all about sizing so that white or black cells will > > align, independantly of the piece that may be within it. > > A white knight may stand alone in text, in which case no variation > exists for display beyond the base glyph in the font. > > A white knight may need to be represented with specific > em-square-related metrics within the font, in two variations, one > with no fill on the background of the em-square indicating the piece > on a white square, and one with a (typically ///-shaped) fill > indicating the piece on a black square. I'm uneasy about the semantics of the sequences. To take the extreme example, would be the white bishop on a (or 'with its') white square while would be the white bishop on a black square. Perhaps someone can show me evidence from mathematical symbols or Japanese kanji that such semantic modifications are perfectly acceptable. I have related unease about the glyph of , intended for a white rook on a black square, being used in text where the meaning is 'white rook' regardless of what square it is on. If these variation sequences are acccepted, I hope that the intention that they contribute to producing presentable populated chess boards in plain text will be captured in at least the Unicode Standard. I can see issues with line-spacing, which I believe is formally out of control in true plain text. If my unease is well-founded, then I think we have a case for two combining marks akin to U+20E3 COMBINING ENCLOSING KEYCAP. Unfortunately, that would not be as simple to use (or define) as the proposed variation sequences. I'm also bothered by the purposes of the Format 14 'cmap' subtable. For each supported variation selector, it has a direct and an indirect mapping of base character to glyph. The direct mapping maps the character to the glyph to be used when qualified by the variation selector; that makes perfect sense. The indirect mapping gives a list of characters for which the 'default' glyph is to be used, i.e. the cmap subtables that do not directly support variation selectors are to be used. As I understand it, this list will in general not include all the characters supported by the font but without a non-default variation sequence mapping. Thus if a font supports the use of U+E0100 VARIATION SELECTOR-17, the table for U+E0100 will not have mention of U+0030, for is not a standardised variation sequence, and therefore no font should support it. I believe the purpose of having the indirect mapping is so that one can query whether a font explicitly supports a sequence. Thus the cmap distinguishes two cases: 1) is explicitly catered for, and used the same glyph as unqualified U+82A6. 2) is not catered for. If one needs to be sure of having its distinctive form, another font must be used. If I have understood the intended use correctly, then we need another variation sequence to explicitly specify a glyph of U+2656 suitable for use in plain-looking running text, analogous to for a text-style '2'. A renderer can then ask whether a font supports plain text white rooks, as opposed to providing one dimensioned for assembling chess boards. Richard. From everson at evertype.com Sun Apr 2 04:57:49 2017 From: everson at evertype.com (Michael Everson) Date: Sun, 2 Apr 2017 11:57:49 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170402095322.17526d87@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> Message-ID: <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> On 2 Apr 2017, at 10:53, Richard Wordingham wrote: >> A white knight may need to be represented with specific em-square-related metrics within the font, in two variations, one with no fill on the background of the em-square indicating the piece on a white square, and one with a (typically ///-shaped) fill indicating the piece on a black square. > > I'm uneasy about the semantics of the sequences. There isn?t any, not really. This isn?t a problem. > To take the extreme example, would be the white bishop on a (or ?with its') white square No, ?on a? is correct. It?s a display mode. The semantics (such as they are) of a white knight have to do with its position in the 8 x 8 chessboard matrix (which can be parsed in plain text, which is why this proposal is useful in terms of chessboard data). In Figure 3, even in the inadequately formatted examples the plain text arrangement of the squares and chess characters can be read and understood. Proper formatting requires shadowing when a chess piece is on a black square, but whether that square is (in algebraic notation) c6 or d7 is a matter of the matrix, not the font display. This proposal permits a regular > while would be the white bishop on a black square. Perhaps someone can show me evidence from mathematical symbols or Japanese kanji that such semantic modifications are perfectly acceptable. There isn?t a semantic modification. It?s a graphic modification, just like Even the emoji VSes regulate display, not semantics. Thus: 2194 ? LEFT RIGHT ARROW = z notation relation ? 2194 FE0E text style ? 2194 FE0F emoji style This is a matter of display, of font glyph selection. > I have related unease about the glyph of , intended for a white rook on a black square, being used in text where the meaning is 'white rook' regardless of what square it is on. That sequence selects a glyph in the font which draws the white rook surrounded by the diagonal lines which indicate a black square. It has no semantics but what you read into it. (OK, it ?means" the rook could logically be on a1, c1, e1, g1, b2, d2, f2, h2, a3, c3, e3, g3, b4, d4, f4, h4 etc. ? but this is nothing to feel uneasy about. And what, pray, is the alternative? A full matrix like this: ?????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????? ? is a set of characters which can be parsed. It?s text which can be parsed, and because it uses Unicode characters (rather than the ASCII and Symbol font hacks described in ?2 of the proposal) it can be sent and received with fidelity and can be displayed nicely with a conformant font. > If these variation sequences are acccepted, I hope that the intention that they contribute to producing presentable populated chess boards in plain text will be captured in at least the Unicode Standard. My intention would be to re-format the proposal document as a UTN for guidance to implementors, if that?s what you mean. > I can see issues with line-spacing, which I believe is formally out of control in true plain text. So is the font rendering. The board which I have pasted in above in this e-mail doesn?t look great in Everson Mono (which is what I use to view my plain-text e-mail) ? because I haven?t added the sequences to that font yet ? but it is legible. And I can cut and paste it into another document where I have more font control. Yes, some control over line-spacing might be needed in some environments for optimum results. THat?s why in the proposal says things like "set in Ludus in 24 points with 26-point leading? where relevant. > If my unease is well-founded, then I think we have a case for two combining marks akin to U+20E3 COMBINING ENCLOSING KEYCAP. Unfortunately, that would not be as simple to use (or define) as the proposed variation sequences. We could add some *COMBINING WHITE GAME SQUARE FILTER and *COMBINING BLACK GAME SQUARE FILTER, but this does not simplify matters. First, you would have to decide what base character to use for the squares on which no characters stand. I think that the proposed 25A1 WHITE SQUARE and 25A8 SQUARE WITH UPPER RIGHT TO LOWER LEFT FILL make better sense because in environments where the OpenType features cannot be supplied the plain text is still legible, if not beautiful. Your suggestion is not going to alter the burden on the font with regard to display. > I'm also bothered by the purposes of the Format 14 'cmap' subtable. I took this text from a successful proposal dealing with variation selectors by Ken Lunde of Adobe. I am not attached to it. To me, the instruction: sub uni2654 uniFE00 by uni2654FE00 ; sub uni2654 uniFE01 by uni2654FE01 ; is all that I implemented, and the result was what was expected. > For each supported variation selector, it has a direct and an indirect mapping of base character to glyph. The direct mapping maps the character to the glyph to be used when qualified by the variation selector; that makes perfect sense. The indirect mapping gives a list of characters for which the 'default' glyph is to be used, i.e. the cmap subtables that do not directly support variation selectors are to be used. As I understand it, this list will in general not include all the characters supported by the font but without a non-default variation sequence mapping. Thus if a font supports the use of U+E0100 VARIATION SELECTOR-17, the table for U+E0100 will not have mention of U+0030, for is not a standardised variation sequence, and therefore no font should support it. Um, I don?t understand a word of what you?ve said here, whatever you mean by ?direct mapping? and ?indirect mapping". All I know is that I used OpenType rules in my font to get sequences to point to certain glyphs, and the result works as intended. You can see this in the proposal document. > I believe the purpose of having the indirect mapping is so that one can query whether a font explicitly supports a sequence. Thus the cmap distinguishes two cases: > > 1) is explicitly catered for, and used the same glyph as unqualified U+82A6. I?m not using E0100. And I?m not using CJK character ?. I did not propose sequences for unqualified chess pieces because I didn?t see any reason why there should be a benefit for it. If there is some genuine benefit, obviously the sequences in my proposal could be altered from 2654 FE00; Chesspiece on white; # WHITE CHESS KING 2654 FE01; Chesspiece on black; # WHITE CHESS KING (that is: sub uni2654 uniFE00 by uni2654FE00 ; sub uni2654 uniFE01 by uni2654FE01 ;) to 2654 FE00; Unqualified chesspiece; # WHITE CHESS KING 2654 FE01; Chesspiece on white; # WHITE CHESS KING 2654 FE02; Chesspiece on black; # WHITE CHESS KING (that is: sub uni2654 uniFE00 by uni2654 ; sub uni2654 uniFE01 by uni2654FE02 ; sub uni2654 uniFE02 by uni2654FE01 ;) But I didn?t see any need for that, since 2654 is already the unqualified chesspiece. If there?s a formal need for triplets rather than couplets here, I?ll conform to it, but that seems to be incidental to the robustness of the proposal. > 2) is not catered for. If one needs to be sure of having its distinctive form, another font must be used. > > If I have understood the intended use correctly, then we need another variation sequence to explicitly specify a glyph of U+2656 suitable for use in plain-looking running text, analogous to for a text-style '2'. A renderer can then ask whether a font supports plain text white rooks, as opposed to providing one dimensioned for assembling chess boards. If a font doesn?t support a glyph or a sequence, then operating systems substitute other glyphs or the .notdef glyph or whatever, no? Michael Everson From verdy_p at wanadoo.fr Sun Apr 2 05:54:48 2017 From: verdy_p at wanadoo.fr (Philippe Verdy) Date: Sun, 2 Apr 2017 12:54:48 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> Message-ID: 2017-04-02 11:57 GMT+02:00 Michael Everson : > I?m not using E0100. And I?m not using CJK character ?. I did not propose > sequences for unqualified chess pieces because I didn?t see any reason why > there should be a benefit for it. If there is some genuine benefit, > obviously the sequences in my proposal could be altered from > > 2654 FE00; Chesspiece on white; # WHITE CHESS KING > 2654 FE01; Chesspiece on black; # WHITE CHESS KING > > (that is: > > sub uni2654 uniFE00 by uni2654FE00 ; > sub uni2654 uniFE01 by uni2654FE01 ;) > > to > > 2654 FE00; Unqualified chesspiece; # WHITE CHESS KING > 2654 FE01; Chesspiece on white; # WHITE CHESS KING > 2654 FE02; Chesspiece on black; # WHITE CHESS KING > > (that is: > > sub uni2654 uniFE00 by uni2654 ; > sub uni2654 uniFE01 by uni2654FE02 ; > sub uni2654 uniFE02 by uni2654FE01 ;) > > But I didn?t see any need for that, since 2654 is already the unqualified > chesspiece. If there?s a formal need for triplets rather than couplets > here, I?ll conform to it, but that seems to be incidental to the robustness > of the proposal. > > > 2) is not catered for. If one needs to be sure of > having its distinctive form, another font must be used. > > > > If I have understood the intended use correctly, then we need another > variation sequence to explicitly specify a glyph of U+2656 suitable for use > in plain-looking running text, analogous to for a text-style > '2'. A renderer can then ask whether a font supports plain text white > rooks, as opposed to providing one dimensioned for assembling chess boards. > > If a font doesn?t support a glyph or a sequence, then operating systems > substitute other glyphs or the .notdef glyph or whatever, no? > > Semantically, using variation selectors for this usage seems a bit strange for me: you are adding a semantic for the "on a cell" which also affects the metrics and placement of the piece (to center it within the checkboard cell). What is represented is then BOTH a chess piece (such as 2654), AND a checkboard cell (in your example you took 25A1 WHITE SQUARE but if its metrics is appropriate for use in plain text, its margins are inappropriate for use in a checkboard where cells should be touching without any margin). There's still no reliable way to represent the empty cells except by adding a variation selector on the 25A1 WHITE SQUARE to transform it into a true cell. Then how to add the chess piece in it ? in Unicode we traditionally use joinder controls to suggest a ligature. This would then produce something like: <25A1, VS-1, ZWJ, 2654>, the first part before ZWJ for the cell itself. You are promoting a simpler encoding using pairs by encoding separate variants of the pieces themselves (two variants for the "on white cell" and "on black cell") but this is still not consistant for the empty cells: do you accept 00A0 NBSP to represent the absence of piece so that <00A0 FE00> and <00A0 FE01> will correctly represent the colored cells ? -------------- next part -------------- An HTML attachment was scrubbed... URL: From richard.wordingham at ntlworld.com Sun Apr 2 11:27:10 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Sun, 2 Apr 2017 17:27:10 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> Message-ID: <20170402172710.54c37ad2@JRWUBU2> On Sun, 2 Apr 2017 11:57:49 +0200 Michael Everson wrote: > On 2 Apr 2017, at 10:53, Richard Wordingham > wrote: > > while would be the white bishop on a black square. > > Perhaps someone can show me evidence from mathematical symbols or > > Japanese kanji that such semantic modifications are perfectly > > acceptable. > There isn?t a semantic modification. It?s a graphic modification, > just like Even the emoji VSes regulate display, not semantics. Thus: > > 2194 ? LEFT RIGHT ARROW > = z notation relation > ? 2194 FE0E text style > ? 2194 FE0F emoji style > > This is a matter of display, of font glyph selection. We seem to agree that it should be a graphic modification, rather than as semantic modification. The question I pose is, "Is it just a graphic modification in this case?". I'm not convinced that it is. A player starts with two non-interchangeable bishops. could only refer the white bishop that is restricted to black squares. That's a semantic difference. > > If these variation sequences are acccepted, I hope that the > > intention that they contribute to producing presentable populated > > chess boards in plain text will be captured in at least the Unicode > > Standard. > My intention would be to re-format the proposal document as a UTN for > guidance to implementors, if that?s what you mean. > > I can see issues with line-spacing, which I believe is formally out > > of control in true plain text. > > So is the font rendering. The immediate parallel that comes to mind is the ideographic square. A sequence of CJK ideographs should be a monospace sequence - and that is the major point of most of the ASCII clones with 'IDEOGRAPHIC' or 'FULLWIDTH' in their names. The uniform width is a key part of the semantic of the seqeunces being discussed. > We could add some *COMBINING WHITE GAME SQUARE FILTER and *COMBINING > BLACK GAME SQUARE FILTER, but this does not simplify matters. First, > you would have to decide what base character to use for the squares > on which no characters stand. I think that the proposed 25A1 WHITE > SQUARE and 25A8 SQUARE WITH UPPER RIGHT TO LOWER LEFT FILL make > better sense because in environments where the OpenType features > cannot be supplied the plain text is still legible, if not beautiful. U+00A0 makes a lot of sense as the base character. Also having variants of U+25A1 and U+25A8 that match the game square filter modifiers seems quite legitimate. Possible lack of OpenType support is supposed not to be an admissible justification. > Your suggestion is not going to alter the burden on the font with > regard to display. My suggestion actually increases it. I suggested it because it seems to be the proper thing to do. Variation sequences seem to be the easier solution - provided they are supported in the first place. > to > > 2654 FE00; Unqualified chesspiece; # WHITE CHESS KING > 2654 FE01; Chesspiece on white; # WHITE CHESS KING > 2654 FE02; Chesspiece on black; # WHITE CHESS KING > > (that is: > > sub uni2654 uniFE00 by uni2654 ; > sub uni2654 uniFE01 by uni2654FE02 ; > sub uni2654 uniFE02 by uni2654FE01 ;) > > But I didn?t see any need for that, since 2654 is already the > unqualified chesspiece. If there?s a formal need for triplets rather > than couplets here, I?ll conform to it, but that seems to be > incidental to the robustness of the proposal. It's an incidental detail, but if needed someone will have to attend to it. U+2654 is simply the chesspiece; a font that only had variants for white and 'black' backgrounds could nominate either as the glyph for U+2654 on its own. > > 2) is not catered for. If one needs to be sure > > of having its distinctive form, another font must be used. > > > > If I have understood the intended use correctly, then we need > > another variation sequence to explicitly specify a glyph of U+2656 > > suitable for use in plain-looking running text, analogous to > > for a text-style '2'. A renderer can then ask > > whether a font supports plain text white rooks, as opposed to > > providing one dimensioned for assembling chess boards. > > If a font doesn?t support a glyph or a sequence, then operating > systems substitute other glyphs or the .notdef glyph or whatever, no? No. First of all, the substitution mechanism is usually above the operating system layer, with varying degrees of application control. Secondly, the mechanism can only look for a substitute if it knows that the glyph is missing. If it's looking for an OpenType font for a glyph of the family , the obvious mechanism is to consult the cmap format 14 subtable. The font gives no indication of what glyph families the font's default rendering of U+82A6 is supposed to belong to. Richard. From asmusf at ix.netcom.com Sun Apr 2 12:43:39 2017 From: asmusf at ix.netcom.com (Asmus Freytag) Date: Sun, 2 Apr 2017 10:43:39 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170402172710.54c37ad2@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> Message-ID: <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> An HTML attachment was scrubbed... URL: From richard.wordingham at ntlworld.com Sun Apr 2 12:52:51 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Sun, 2 Apr 2017 18:52:51 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> Message-ID: <20170402185251.58b95878@JRWUBU2> On Sun, 2 Apr 2017 11:57:49 +0200 Michael Everson wrote: > THat?s why in the proposal says things like "set in Ludus in > 24 points with 26-point leading? where relevant. You forgot the most important setting though - that the higher-order protocols allow symbols to be displayed left-to-right. If the direction should happen to be right-to-left, not only is the game mirrored, but the board edges don't work properly, as the glyphs are not mirrored. One needs each bidi-paragraph to be forced to the correct order, e.g. by use of LRM before and after, or, if the board is recorded right-to-left, RLM or ALM before and after. Richard. From christoph.paeper at crissov.de Sun Apr 2 18:21:04 2017 From: christoph.paeper at crissov.de (=?UTF-8?Q?Christoph_P=C3=A4per?=) Date: Mon, 3 Apr 2017 01:21:04 +0200 (CEST) Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com> <1766481648.24541.1491076627544.JavaMail.open-xchange@app08.ox.hosteurope.de> Message-ID: <247062319.25007.1491175265004.JavaMail.open-xchange@app06.ox.hosteurope.de> Michael Everson : > On 1 Apr 2017, at 21:57, Christoph P?per wrote: > > > ? U+25A1 and, especially, ? U+25A8 for empty fields on a board make no > > sense. > > Not so. Think about the data. I do, but I'm thinking about the character, too. > Please distinguish characters from glyphs. I do. To draw board diagrams, you need a character for "field with no game piece on it". You *do not* need two characters, "white field" and "black field"! It's just a (very strong) convention to draw every other field in a different color; they are also distinguished by coordinates A1 through H8. That is why I agree with your proposal to use variation sequences for chess and checkers pieces. The background color poses no semantic difference, except for the bishops perhaps. I'm only suggesting you apply the same logic to empty fields. I evidently don't know which existing character serves that role best, but I strongly believe it should be a single one, not a pair chosen for their glyphs. FE00; White chessboard square; # FE01; Black chessboard square; # instead of FE00; White chessboard square; # FE00; Black chessboard square; # > The white square on a chessbord is not a separating nothingness. It?s a white > square. Unless there are Fairy Chess boards that have adjacent squares of the same color or three different colors with arbitrary distribution, white and black are just optional visual cues for alternating fields. You might argue that using separate whitish and blackish square characters for empty fields provides for better fallback rendering, but the pieces will have no background and possibly render proportionally, too. > The point is to have the underlying board data as plain text, and that means > just using the chess pieces and conventional white and black squares. No, this approach would properly require alternate code points for all chess pieces, just with a different background, like legacy fonts provide. > > 20DE FE00; Combining white chessboard square; # COMBINING ENCLOSING SQUARE > > 20DE FE01; Combining black chessboard square; # COMBINING ENCLOSING SQUARE > > That?s not remotely tempting. It would offer no advantage and would needlessly > complicate the system. I meant the proposal should explain why this approach would be worse. > > this would require only two new entries in StandardizedVariants.txt and be > > more flexible regarding alternate (Fairy Chess) game pieces See? I provided two advantages. > > ? including emojis. > > This proposal has nothing to do with emojis. Maybe I should have included a winking smiley here. > Should fairy chess characters be added to the standard, some additional > entries would be added to StandardizedVariants.txt, yes. > This is a finite set, however, and this should not be problematic to anybody. With a Combining character, fellow Fairy Chess inventors would not be limited to the characters added specifically for this purpose. If they wanted to introduce a Dragon piece, for instance, they could use U+1F432-FE0E-20DE-FE00/1 to represent it. From duerst at it.aoyama.ac.jp Mon Apr 3 05:31:07 2017 From: duerst at it.aoyama.ac.jp (=?UTF-8?Q?Martin_J._D=c3=bcrst?=) Date: Mon, 3 Apr 2017 19:31:07 +0900 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170402172710.54c37ad2@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> Message-ID: On 2017/04/03 01:27, Richard Wordingham wrote: > We seem to agree that it should be a graphic modification, rather than > as semantic modification. The question I pose is, "Is it just a > graphic modification in this case?". I'm not convinced that it is. A > player starts with two non-interchangeable bishops. > could only refer the white bishop that is restricted to black squares. > That's a semantic difference. That applies only to the bishop, and only in standard chess and those chess variants that keep the same restrictions. It's easily possible to imagine or invent variants where bishops can move differently, and it would be weird to use a semantic difference (e.g. different characters) for bishops, but a variant selector for other pieces. Also it would be weird to try e.g. to "semantically" distinguish the two rooks, even if they are two different actual chess pieces on an actual board. > The immediate parallel that comes to mind is the ideographic square. A > sequence of CJK ideographs should be a monospace sequence - and that is > the major point of most of the ASCII clones with 'IDEOGRAPHIC' or > 'FULLWIDTH' in their names. The uniform width is a key part of the > semantic of the seqeunces being discussed. The full width/half width distinction mostly is a legacy (roundtrip) issue. Regards, Martin. From verdy_p at wanadoo.fr Mon Apr 3 06:11:50 2017 From: verdy_p at wanadoo.fr (Philippe Verdy) Date: Mon, 3 Apr 2017 13:11:50 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> Message-ID: 2017-04-03 12:31 GMT+02:00 Martin J. D?rst : > Also it would be weird to try e.g. to "semantically" distinguish the two > rooks, even if they are two different actual chess pieces on an actual > board. > However it is perfectly possible to have pseudo-variants using pieces and an annotation on them such as numbers/letters/symbols added on top of them: with combining marks? As well empty checkboard cells may contain some marks, not just pieces. -------------- next part -------------- An HTML attachment was scrubbed... URL: From everson at evertype.com Mon Apr 3 07:12:52 2017 From: everson at evertype.com (Michael Everson) Date: Mon, 3 Apr 2017 14:12:52 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170402172710.54c37ad2@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> Message-ID: On 2 Apr 2017, at 18:27, Richard Wordingham wrote: > We seem to agree that it should be a graphic modification, rather than as semantic modification. Yes, we do. > The question I pose is, "Is it just a graphic modification in this case?". Yes, it is. > I'm not convinced that it is. A player starts with two non-interchangeable bishops. could only refer the white bishop that is restricted to black squares. That's a semantic difference. Surely not. If it were, we would encode WHITE BISHOP THAT STAYS ON THE WHITE SQUARES and WHITE BISHOP THAT STAYS ON BLACK SQUARES and we would encode WHITE KNIGHT THAT MOVES FROM WHITE SQUARES TO BLACK SQUARES and WHITE KNIGHT THAT MOVES FROM BLACK SQUARES TO WHITE SQUARES. > The immediate parallel that comes to mind is the ideographic square. A sequence of CJK ideographs should be a monospace sequence - and that is the major point of most of the ASCII clones with 'IDEOGRAPHIC? or 'FULLWIDTH' in their names. The uniform width is a key part of the semantic of the seqeunces being discussed. I think you are seriously going the wrong way with this thinking. The immediate parallel that comes to mind are things like: 1000 ? MYANMAR LETTER KA ? 1000 FE00 ? dotted form where the character can still be read if the variation selector?s glyph can?t be shown. Uniform width is a feature of CJK, sure, but that?s the nature of the writing system. Chess pieces for setting withing in ordinary text do NOT have to be an em wide, and they don?t in fonts. Chess pieces on a white square or on a black square do have to have a uniform width in order to produced the board matrix. > U+00A0 makes a lot of sense as the base character. What? NBSP and SP are whitespace characters, with complex behaviours, and chessboards, whether set in lead type or digitally, are sets of simple symbol glyphs. NBSP glues two things together. SP separates things. Chessboards are not collections of black squares glued together by white spaces with white spaces at the alternating ends of lines. I reject this analysis. > Also having variants of U+25A1 and U+25A8 that match the game square filter modifiers seems quite legitimate. Um, wait? What are you proposing NBSP for? I'm confused now. If you like these two characters (and I am glad you do) there?s no need for U+00A0 at all. > Possible lack of OpenType support is supposed not to be an admissible justification. Well, I addressed this in the proposal. OpenType support for the symbol + VS sequences gives the desired result. A board prepared using this encoding proposal is legible even if not beautiful, but is nevertheless parseable, and in my view is a robust and convenient higher-level protocol which is certainly superior to the chaos that currently besets the chess community, who can?t even reliably interchange chessboard data using their ASCII fonts due to the plethora of encodings still in use. (None of the chess fonts I have examined use the Unicode chess characters at all.) >> Your suggestion is not going to alter the burden on the font with regard to display. > > My suggestion actually increases it. I suggested it because it seems to be the proper thing to do. I can?t agree. > Variation sequences seem to be the easier solution - provided they are supported in the first place. It is understood that not all environments may display such ligatures, but that?s true for every character that uses a variation sequence. >> 2654 FE00; Unqualified chesspiece; # WHITE CHESS KING >> 2654 FE01; Chesspiece on white; # WHITE CHESS KING >> 2654 FE02; Chesspiece on black; # WHITE CHESS KING >> >> (that is: >> >> sub uni2654 uniFE00 by uni2654 ; >> sub uni2654 uniFE01 by uni2654FE02 ; >> sub uni2654 uniFE02 by uni2654FE01 ;) >> >> But I didn?t see any need for that, since 2654 is already the >> unqualified chesspiece. If there?s a formal need for triplets rather >> than couplets here, I?ll conform to it, but that seems to be >> incidental to the robustness of the proposal. > > It's an incidental detail, but if needed someone will have to attend to it. U+2654 is simply the chesspiece; a font that only had variants for white and 'black' backgrounds could nominate either as the glyph for U+2654 on its own. No, again, it?s not right to say that chess pieces on their own have to be the width of an em square, and this would disrupt their use in ordinary text. Here are the metrics for the pieces in Ludus: >> If a font doesn?t support a glyph or a sequence, then operating systems substitute other glyphs or the .notdef glyph or whatever, no? > > No. > > First of all, the substitution mechanism is usually above the operating system layer, with varying degrees of application control. Well, yes, OpenType is handled by the font and by the app knowing that the OpenType tables are there. > Secondly, the mechanism can only look for a substitute if it knows that the glyph is missing. The macOS does this quite reliably. If Baskerville has no chess piece, but Ludus does, then a text in Baskerville wlll usually display the Ludus glyph. You can override this by selecting the Ludus gyph and forcing it back to Baskerville and then you get a box or other substitution glyph. > If it's looking for an OpenType font for a glyph of the family , Or any OpenType substitution string. > the obvious mechanism is to consult the cmap format 14 subtable. The font gives no indication of what glyph families the font's default rendering of U+82A6 is supposed to belong to. I don?t really find us in disagreement?. Michael Everson From everson at evertype.com Mon Apr 3 07:42:43 2017 From: everson at evertype.com (Michael Everson) Date: Mon, 3 Apr 2017 13:42:43 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> Message-ID: <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> On 2 Apr 2017, at 19:43, Asmus Freytag wrote: > It's a matter of perspective. > > Higher-level semantic constructs are encoded in writing (or graphic notation), and you can see the individual marks, signs, letters and symbols as the element of this encoding. However, how strongly any of these marks, signs, letters and symbols are associated with a specific semantic, and how fixed that association is, depends on convention. > Asmus, I don?t follow this abstraction of yours. The proposal is simple. The proposal works when OpenType substitutions of ?piece? plus ?VS? are in the font and when an app can display such a substitution. > For example, "left arrow" has a very loose associating with a broad range of concepts that somehow relate to direction. > In contrast, "integral sign" is rarely associated with any concept outside calculus. And chess piece characters are symbols which mean chess pieces. > It's tempting then, to assume that the character for "integral sign" somehow directly represents the semantic of "integration" --- except it doesn't. > > The same indirection is at play here. This is pure rhetoric, Asmus. It addresses the problem in no way. > My dislike for using variation sequences in the way Micheal appear to advocate is based on a different reason: This is almost funny. Ordinarily I dislike variation sequences because I consider them pseudo-encoding. > the oft-stated fact that variation selectors may be ignored. I?m aware of this. I may be wrong, but I believe you advocated for the encoding of variation sequences for mathematics purposes. > If they are, any plain text that depends on the contrasting use of white and black chess background will become meaningless gibberish. This is untrue. Did you not read the proposal? Look again at Figure 3. In the left hand column, the top example, which is only one of the several AsCII-based ways that chess fonts represent chessboards today (without any Unicode chess characters at all). It is legible only if that particular font is loaded. The middle example in the same column is not very good looking. But it is stable, parseable, exchangeable data which gives unique tokens for the empty squares in two colours and which contains the chess characters. It?s not ?meaningless gibberish? and it?s not even very difficult to read. Same for the bottom example, which has been force-justified to facilitate legibility; while that font has visible glyphs for the variation selectors, it needn?t. > In these cases, explicit encoding would better cover what is desired: a reliable way to mark a distinction between different symbols (the two bishops are separate symbols, that also happen to express distinct, though related concepts -- it is not a single symbol with some ignorable attributes). Well, Asmus, if by "explicit encoding? you mean ?add more chess characters? this would require the trebling of the number of basic chess characters from 12 to 36. You couldn?t get away with adding just six chesspieces-on-black because then fonts would be forced to draw all the chesspieces-on-white with the same em-square metrics needed to produce chessboards. But that would mean that nobody could use the ordinary chess pieces as just symbols in plain text (as seen in Figures 6 and 8). I do not believe that burdening chess users with having to use different fonts for in-text characters on the one hand and board-layout on the other is a good idea, particularly when both forms of presentation are the norm in chess-problem publishing. Further, it would delay implementation of a chessboard solution till the summer of 2019 for no benefit, since the proposal here is simple to implement with nothing more than care on the part of the font designer. And when in the past encoding pieces-on-black has been suggested, the answer has been: no, use a higher-level protocol. This proposal is a robust and simple higher-level protocol. It enables the preparation of parseable chessboards without having to add characters, or without the problem of having pieces-for-use-in-text looking nearly identical to pieces-for-use-on-white-squares. > Now, for the case of suggesting the chess-board cell dimensions, I do not have the same objection to the use of variation selectors. If the variation selectors get stripped, the text may require manual formatting to look correct, but it will still contain the correct symbols (and applying the chosen convention, you will be able to know which bishop is meant). > > That's much closer to the way variation selectors are intended to be used. What? You are very unclear here. Are you saying that the empty white and black squares should use VS but the chess pieces are not? That makes no sense to me at all. Michael Everson From everson at evertype.com Mon Apr 3 07:50:06 2017 From: everson at evertype.com (Michael Everson) Date: Mon, 3 Apr 2017 13:50:06 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170402185251.58b95878@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402185251.58b95878@JRWUBU2> Message-ID: <46C3463F-ECCB-4F85-BA78-F73BD7F70A66@evertype.com> On 2 Apr 2017, at 18:52, Richard Wordingham wrote: > > You forgot the most important setting though - that the higher-order protocols allow symbols to be displayed left-to-right. If the direction should happen to be right-to-left, not only is the game mirrored, but the board edges don't work properly, as the glyphs are not mirrored. One needs each bidi-paragraph to be forced to the correct order, e.g. by use of LRM before and after, or, if the board is recorded right-to-left, RLM or ALM before and after. None of the characters listed in ?3 has a mirroring property. Michael Everson From kent.karlsson14 at telia.com Mon Apr 3 09:41:58 2017 From: kent.karlsson14 at telia.com (Kent Karlsson) Date: Mon, 03 Apr 2017 16:41:58 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <46C3463F-ECCB-4F85-BA78-F73BD7F70A66@evertype.com> Message-ID: Den 2017-04-03 14:50, skrev "Michael Everson" : > On 2 Apr 2017, at 18:52, Richard Wordingham > wrote: >> >> You forgot the most important setting though - that the higher-order >> protocols allow symbols to be displayed left-to-right. If the direction >> should happen to be right-to-left, not only is the game mirrored, but the >> board edges don't work properly, as the glyphs are not mirrored. One needs >> each bidi-paragraph to be forced to the correct order, e.g. by use of LRM >> before and after, or, if the board is recorded right-to-left, RLM or ALM >> before and after. > > None of the characters listed in ?3 has a mirroring property. Right, but most of them have bidi property ON (other neutral), so in a right-to-left context, the chess board characters will be reversed (on each line, but the VSs (which are NSM) still go with their base). This would 1) mirror the chess *board* display (but not the chess *piece* glyphs) 2) mess up the corner glyphs, which are not mirrored; and also the RIGHT/LEFT ONE EIGHTH BLOCK glyphs, which aren't mirrored either. Issue 2 will result in ugly display. Issue 1 will confuse the reader, mirroring the entire chess board (if one disregards the ugly display of the corner and left/right borders). Hence the chess board lines should be displayed in a strong left-to-right context (either via bidi markup characters, or via some higher order bidi markup mechanism, such as the "bidi" attribute in HTML). Though in most cases (not Arabic/Hebrew/... document), the bidi context will default to left-to right... For cut-and-paste to work well also when pasting to a right-to-left context document, bidi markup characters are probably better than using a higher-level attribute. I think that is why Richard argues for using bidi characters to make the lines strong left-to-right (without having to surround each chess board line with visible strong l-t-r characters). You might argue for making the board corner and board left/right border characters strong l-t-r. Not sure if that would sit well with the UTC... /Kent K From gerrietm at icloud.com Mon Apr 3 02:12:51 2017 From: gerrietm at icloud.com (Gerriet M. Denkmann) Date: Mon, 3 Apr 2017 14:12:51 +0700 Subject: Combining Class of Thai Nonspacing_Marks Message-ID: <1D352E5A-C506-4DC4-8F91-4E0100522384@icloud.com> The Combining Class is used for normalisation of strings. Normalisation of strings is important for filenames in filesystems. As far as I know, a Thai consonant (Lo, Other_Letter) can have several Nonspacing_Marks. This cluster of nonspacing marks can contain at most one top/bottom vowel and at most one tone/other mark. There is no syntactically meaning in the order of these nonspacing marks. So: All top/bottom vowels should have Combining Class 103, all tone/other marks have Combining Class 107. Is there a reason for having top vowels or other-marks with Combining Class 0, Not_Reordered? With the current choice of Combining Class both consonant + mark + top vowel and consonant + top vowel + mark are normalised, so that one can have two files with these (identically looking, but different) names, which is rather confusing. Here a list of all nonspacing marks in the Thai script: top vowels (Combining Class 0, Not_Reordered): ? this seems to be wrong; should be 103 THAI CHARACTER MAI HAN-AKAT ? THAI CHARACTER SARA I ? THAI CHARACTER SARA II ? THAI CHARACTER SARA UE ? THAI CHARACTER SARA UEE ? bottom vowels (Combining Class 103): THAI CHARACTER SARA U ? THAI CHARACTER SARA UU ? tone-marks (Combining Class 107): THAI CHARACTER MAI EK ? THAI CHARACTER MAI THO ? THAI CHARACTER MAI TRI ? THAI CHARACTER MAI CHATTAWA ? other-marks (Combining Class 0, Not_Reordered): ? this seems to be wrong, should be 107 THAI CHARACTER MAITAIKHU ? THAI CHARACTER THANTHAKHAT ? THAI CHARACTER NIKHAHIT ? THAI CHARACTER YAMAKKAN ? other-marks (Combining Class 9, Virama) THAI CHARACTER PHINTHU ? Gerriet. From asmusf at ix.netcom.com Mon Apr 3 11:16:11 2017 From: asmusf at ix.netcom.com (Asmus Freytag) Date: Mon, 3 Apr 2017 09:16:11 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> Message-ID: <742647d6-75f8-2f59-4b60-75a67ea73572@ix.netcom.com> On 4/3/2017 5:42 AM, Michael Everson wrote: Read to the end. > On 2 Apr 2017, at 19:43, Asmus Freytag wrote: > >> It's a matter of perspective. >> >> Higher-level semantic constructs are encoded in writing (or graphic notation), and you can see the individual marks, signs, letters and symbols as the element of this encoding. However, how strongly any of these marks, signs, letters and symbols are associated with a specific semantic, and how fixed that association is, depends on convention. >> Asmus, I don?t follow this abstraction of yours. The proposal is simple. The proposal works when OpenType substitutions of ?piece? plus ?VS? are in the font and when an app can display such a substitution. >> For example, "left arrow" has a very loose associating with a broad range of concepts that somehow relate to direction. >> In contrast, "integral sign" is rarely associated with any concept outside calculus. > And chess piece characters are symbols which mean chess pieces. > >> It's tempting then, to assume that the character for "integral sign" somehow directly represents the semantic of "integration" --- except it doesn't. >> >> The same indirection is at play here. > This is pure rhetoric, Asmus. It addresses the problem in no way. Actually it does. I'm amazed that you don't see the connection. > >> My dislike for using variation sequences in the way Micheal appear to advocate is based on a different reason: > This is almost funny. Ordinarily I dislike variation sequences because I consider them pseudo-encoding. > >> the oft-stated fact that variation selectors may be ignored. > I?m aware of this. I may be wrong, but I believe you advocated for the encoding of variation sequences for mathematics purposes. Yes, for those cases where the differences are known to not carry meaning, but where duplicating all fonts or duplicating the characters would have been the wrong solution to allow support for both conventions (e.g. upright vs. slanted integral signs, details of relational operator design, etc.). > >> If they are, any plain text that depends on the contrasting use of white and black chess background will become meaningless gibberish. > This is untrue. Did you not read the proposal? Look again at Figure 3. In the left hand column, the top example, which is only one of the several AsCII-based ways that chess fonts represent chessboards today (without any Unicode chess characters at all). It is legible only if that particular font is loaded. The middle example in the same column is not very good looking. But it is stable, parseable, exchangeable data which gives unique tokens for the empty squares in two colours and which contains the chess characters. It?s not ?meaningless gibberish? and it?s not even very difficult to read. Same for the bottom example, which has been force-justified to facilitate legibility; while that font has visible glyphs for the variation selectors, it needn?t. > >> In these cases, explicit encoding would better cover what is desired: a reliable way to mark a distinction between different symbols (the two bishops are separate symbols, that also happen to express distinct, though related concepts -- it is not a single symbol with some ignorable attributes). > Well, Asmus, if by "explicit encoding? you mean ?add more chess characters? this would require the trebling of the number of basic chess characters from 12 to 36. You couldn?t get away with adding just six chesspieces-on-black because then fonts would be forced to draw all the chesspieces-on-white with the same em-square metrics needed to produce chessboards. But that would mean that nobody could use the ordinary chess pieces as just symbols in plain text (as seen in Figures 6 and 8). I do not believe that burdening chess users with having to use different fonts for in-text characters on the one hand and board-layout on the other is a good idea, particularly when both forms of presentation are the norm in chess-problem publishing. > > Further, it would delay implementation of a chessboard solution till the summer of 2019 for no benefit, since the proposal here is simple to implement with nothing more than care on the part of the font designer. > > And when in the past encoding pieces-on-black has been suggested, the answer has been: no, use a higher-level protocol. > > This proposal is a robust and simple higher-level protocol. It enables the preparation of parseable chessboards without having to add characters, or without the problem of having pieces-for-use-in-text looking nearly identical to pieces-for-use-on-white-squares. > >> Now, for the case of suggesting the chess-board cell dimensions, I do not have the same objection to the use of variation selectors. If the variation selectors get stripped, the text may require manual formatting to look correct, but it will still contain the correct symbols (and applying the chosen convention, you will be able to know which bishop is meant). >> >> That's much closer to the way variation selectors are intended to be used. > What? You are very unclear here. Are you saying that the empty white and black squares should use VS but the chess pieces are not? That makes no sense to me at all. I'm saying that perhaps it would be appropriate to select M-square glyph variants via a variation selector. That seems a clear-cut glyph *variation* to me. (If this variation is ignored, then the text looks bad, but in a way that is similar to selecting the wrong font - which is a rule-of-thumb way of evaluating whether variation selectors are appropriate). The distinction between white/black background might be of a different nature. If you have arranged everything in a grid with the correct matrix, then the color of the background is perhaps redundant, given that there is a uniform convention for it. If you assume the characters will ever be used outside a full grid, then that assumption fails and it will not be possible to restore the intended meaning if the variation selectors are missing. That's a warning flag, that they may not be appropriate for that use. That's all. A./ > > Michael Everson > From asmusf at ix.netcom.com Mon Apr 3 11:18:28 2017 From: asmusf at ix.netcom.com (Asmus Freytag) Date: Mon, 3 Apr 2017 09:18:28 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> Message-ID: <88b9a791-4d5c-534f-160a-ec0f521135d1@ix.netcom.com> An HTML attachment was scrubbed... URL: From asmusf at ix.netcom.com Mon Apr 3 11:40:05 2017 From: asmusf at ix.netcom.com (Asmus Freytag) Date: Mon, 3 Apr 2017 09:40:05 -0700 Subject: Combining Class of Thai Nonspacing_Marks In-Reply-To: <1D352E5A-C506-4DC4-8F91-4E0100522384@icloud.com> References: <1D352E5A-C506-4DC4-8F91-4E0100522384@icloud.com> Message-ID: <6f999b27-654d-1220-e710-cc170de82a37@ix.netcom.com> On 4/3/2017 12:12 AM, Gerriet M. Denkmann wrote: > The Combining Class is used for normalisation of strings. > Normalisation of strings is important for filenames in filesystems. The same issues apply to network identifiers. > > As far as I know, a Thai consonant (Lo, Other_Letter) can have several Nonspacing_Marks. > This cluster of nonspacing marks can contain at most one top/bottom vowel and at most one tone/other mark. > There is no syntactically meaning in the order of these nonspacing marks. > > So: All top/bottom vowels should have Combining Class 103, all tone/other marks have Combining Class 107. > > Is there a reason for having top vowels or other-marks with Combining Class 0, Not_Reordered? > > With the current choice of Combining Class both consonant + mark + top vowel and consonant + top vowel + mark are normalised, so that one can have two files with these (identically looking, but different) names, which is rather confusing. It is not possible to construct a set of secure network identifiers based on simply a) ensuring the string is in NFC b) otherwise allowing all of the Thai characters (insofar as the they are PVALID in IDNA 2008 [RFC5892]). Considerable attention to allowable contexts is required. There is a group in Thailand working on this, but their results have not yet been made public. Similar work for Khmer and Lao can be found here: https://www.icann.org/en/system/files/files/proposal-khmer-lgr-15aug16-en.pdf https://www.icann.org/en/system/files/files/proposal-lao-lgr-31jan17-en.pdf A./ > > Here a list of all nonspacing marks in the Thai script: > > top vowels (Combining Class 0, Not_Reordered): ? this seems to be wrong; should be 103 > THAI CHARACTER MAI HAN-AKAT ? > THAI CHARACTER SARA I ? > THAI CHARACTER SARA II ? > THAI CHARACTER SARA UE ? > THAI CHARACTER SARA UEE ? > > bottom vowels (Combining Class 103): > THAI CHARACTER SARA U ? > THAI CHARACTER SARA UU ? > > tone-marks (Combining Class 107): > THAI CHARACTER MAI EK ? > THAI CHARACTER MAI THO ? > THAI CHARACTER MAI TRI ? > THAI CHARACTER MAI CHATTAWA ? > > other-marks (Combining Class 0, Not_Reordered): ? this seems to be wrong, should be 107 > THAI CHARACTER MAITAIKHU ? > THAI CHARACTER THANTHAKHAT ? > THAI CHARACTER NIKHAHIT ? > THAI CHARACTER YAMAKKAN ? > > other-marks (Combining Class 9, Virama) > THAI CHARACTER PHINTHU ? > > Gerriet. > > > From markus.icu at gmail.com Mon Apr 3 12:51:15 2017 From: markus.icu at gmail.com (Markus Scherer) Date: Mon, 3 Apr 2017 10:51:15 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <88b9a791-4d5c-534f-160a-ec0f521135d1@ix.netcom.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <88b9a791-4d5c-534f-160a-ec0f521135d1@ix.netcom.com> Message-ID: It seems to me that higher-level layout (e.g, HTML+CSS) is appropriate for the board layout (e.g., via a table), board frame style, and cell/field shading. In each field, the existing characters should suffice. markus -------------- next part -------------- An HTML attachment was scrubbed... URL: From wjgo_10009 at btinternet.com Mon Apr 3 13:13:01 2017 From: wjgo_10009 at btinternet.com (William_J_G Overington) Date: Mon, 3 Apr 2017 19:13:01 +0100 (BST) Subject: Tags and custom vector glyph emoji (from Re: Tailoring the Marketplace (is: Re: Unicode Emoji 5.0 characters now final)) In-Reply-To: <11364706.56745.1491240615392.JavaMail.root@webmail12.bt.ext.cpcloud.co.uk> References: <11364706.56745.1491240615392.JavaMail.root@webmail12.bt.ext.cpcloud.co.uk> Message-ID: <7794797.61788.1491243181509.JavaMail.defaultUser@defaultHost> Peter Constable wrote: > William, you completely miss the point: As long as Unicode is the way to provide emoji to consumers, their needs and desires will not be best or fully met. Unicode as an AND gate is too many AND gates. Ah, I understand what you mean now. In my feedback of 7 March 2017 to PRI #348 on the Length of Tag Sequences I included the following. quote .... for example, a vector glyph in a platform-independent colour-font-style contour format could be expressed using tag characters. end quote Following your post and my now understanding your meaning I have written some notes about the above possibility. Previously I have made some colour fonts using the High-Logic FontCreator program. I do not claim to be expert on the OpenType colour font format, yet I know about the idea of having several glyphs with each such glyph being of one colour and then combining them to produce a colourful glyph and I also know about the option to include a default monochrome glyph. I enjoy trying to devise encoding systems, so I have tried to produce a way to send the information for a colourful glyph within a tag sequence. I am thinking that a future email or text message reception system could decode the tag sequence and add a colourful glyph to the font being used to display the message. This method, if people can get it to work satisfactorily, would allow custom vector glyph emoji within an interoperable plain text system. Here is a transcript of what I have produced so far. Readers of this thread are invited to have a look at the idea and are welcome to try to implement it if they so choose. If any additions are needed, or indeed if any changes are needed, please say. There needs to be a way so that the tag sequence for the glyph for a particular character is only sent once in a message even though the character may be used more than once in the message. Tags and custom vector glyph emoji Some notes as at Monday 3 April 2017 19:04 pm British Summer Time A tag sequence for this purpose starts with a capital letter V standing for vector format. At the start of the sequence a:=255; b:=0; g:=0; m:=1; p:=0; r:=0; x:=0; y:=0; w:=1000; At the start of the sequence the points buffer is empty, the contours buffer is empty and the glyphs buffer is empty. The system uses a special-purpose virtual computing engine within a software sandbox. The special-purpose virtual computing engine has no commands for loops and is a single pass interpretative system. ---- Letters that are each used both as a command and also as the name of a register in the special-purpose virtual computing engine. a means {a:=p; p:=0; m:=0;} b means {b:=p; p:=0; m:=0;} g means {g:=p; p:=0; m:=0;} m means {m:=1;} p means {p:=0;} r means {r:=p; p:=0; m:=0;} x means {x:=p; p:=0;} y means {y:=p; p:=0;} w means {w:=p; p:=0;} ---- Letters that are used as a command but not as the name of a register in the special-purpose virtual computing engine. c means {define a closed contour from the points in the points buffer; clear the points buffer ready for the next point; x:=0; y:=0; p:=0;} d means {define a glyph from the contour or contours in the contours buffer, if m=1 then the the glyph is the first glyph and is the monochrome glyph, else the glyph is of colour (r, g, b, a) and is not the first glyph; clear the contours buffer ready for the next glyph;clear the points buffer ready for the next point; a:=255; b:=0; g:=0; r:=0; x:=0; y:=0; p:=0; m:=0;} The use of the m register is so that a default monochrome glyph may optionally be included as the first glyph defined. If any component of the colour or opacity is defined before a d command is used, then the monochrome component is left empty. f means {define an off curve point using x and y; x:=0; y:=0; p:=0;} h means {define a complete glyph of advance width w from the glyph or glyphs in the glyphs buffer and have it ready for access by the main program; halt;} n means {define an on curve point using x and y; x:=0; y:=0; p:=0;} ---- Digits Digits 0 .. 9 each mean p:=10*p + (digit); The system is designed to be notionally for an emoji glyph within a virtual space of (x from 0 .. 1000 and y from 0 .. 1000). These values may be scaled to fit with the metrics of a real world font with which a glyph communicated using this system is applied. ---- A tag sequence for this purpose ends with a cancel tag. ---- Some basic examples of parts of a tag sequence to provide an idea of how the system would be used. The following part of a tag sequence would set the x register to have the value 250. 250x The following part of a tag sequence would define an on-curve point at (x,y) = (250, 900) 250x900yn The following part of a tag sequence would define a contour. 250x900yn800x500yf250x100ync The following part of a tag sequence would define a colour glyph that has one contour. 250x900yn800x500yf250x100ync255b128gd ---- Conclusion It seems that it would be possible for such a system to work, though the tag sequences would be quite long, yet the system could allow a colourful glyph to be expressed in an interoperable plain text format without needing any file attachment to the plain text sequence. William Overington Monday 3 April 2017 From kent.karlsson14 at telia.com Mon Apr 3 13:46:17 2017 From: kent.karlsson14 at telia.com (Kent Karlsson) Date: Mon, 03 Apr 2017 20:46:17 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: Message-ID: Den 2017-04-03 19:51, skrev "markus.icu at gmail.com" : > It seems to me that higher-level layout (e.g, HTML+CSS) is appropriate for the > board layout (e.g., via a table), board frame style, and cell/field shading. > In each field, the existing characters should suffice. > > markus True, and one can easily find an example online. Slightly modified from http://stackoverflow.com/questions/18505921/chess-using-tables

True, and one can easily find an example online.

Slightly modified from http://stackoverflow.com/questions/18505921/chess-using-tables

-------------- next part -------------- An HTML attachment was scrubbed... URL: From richard.wordingham at ntlworld.com Mon Apr 3 14:19:43 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Mon, 3 Apr 2017 20:19:43 +0100 Subject: Combining Class of Thai Nonspacing_Marks In-Reply-To: <1D352E5A-C506-4DC4-8F91-4E0100522384@icloud.com> References: <1D352E5A-C506-4DC4-8F91-4E0100522384@icloud.com> Message-ID: <20170403201943.28d8e721@JRWUBU2> On Mon, 3 Apr 2017 14:12:51 +0700 "Gerriet M. Denkmann" wrote: > The Combining Class is used for normalisation of strings. > Normalisation of strings is important for filenames in filesystems. > > As far as I know, a Thai consonant (Lo, Other_Letter) can have > several Nonspacing_Marks. This cluster of nonspacing marks can > contain at most one top/bottom vowel and at most one tone/other mark. > There is no syntactically meaning in the order of these nonspacing > marks. You're confusing the modern Thai language with the Thai script. It seems that the Lao-style usage of NIKHAHIT as a vowel is known from older Thai writing, and when used this way it could of course take a tone mark. It also seems that the pressure to have both MAITAIKHU and a tone mark on a consonant has been accepted for at least one minority language. > So: All top/bottom vowels should have Combining Class 103, all > tone/other marks have Combining Class 107. > Is there a reason for having top vowels or other-marks with Combining > Class 0, Not_Reordered? It does one make one wonder if someone hated Thais. It would have been a lot simpler, and have worked better, if the combining classes for Latin diacritics had been used. As it is, one common combination of vowel below and mark above was catered for - SARA U/UU with tone mark. The system doesn't even cater for SARA U + THANTHAKHAT, as in ??????????? 'Phanthip'. The use of values peculiar to Thai (103 and 107) does not help when minority languages use Latin diacritics, such as U+0331 COMBINING MACRON BELOW and U+0303 COMBINING TILDE for Pattani Malay. The viramas that were recognised were given combining class 9; YAMAKKAN and THANTHAKHAT were overlooked. One of the looming problem is that several languages use a combination of PHINTHU and SARA I - both orders are used, though they are not canonically equivalent. Richard. From richard.wordingham at ntlworld.com Mon Apr 3 14:33:55 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Mon, 3 Apr 2017 20:33:55 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> Message-ID: <20170403203355.6cbfc184@JRWUBU2> On Sun, 2 Apr 2017 10:43:39 -0700 Asmus Freytag wrote: > In these cases, explicit encoding would better cover what is desired: > a reliable way to mark a distinction between different symbols (the > two bishops are separate symbols, that also happen to express > distinct, though related concepts -- it is not a single symbol with > some ignorable attributes). There was no intention to encode the bishops separately. It just happens that the rules of chess allow one to distinguish the bishops simply by recording the colour of the square they are currently on. The basic text elements in the scheme other than boundary markers will be: empty white square empty black square white square with specific piece on it black square with specific piece on it. If the variation selectors are ignored, these simplify to: white square hatched square specific piece This preserves all the information; the pattern of squares is known in advance and therefore redundant. Richard. From asmusf at ix.netcom.com Mon Apr 3 14:58:46 2017 From: asmusf at ix.netcom.com (Asmus Freytag) Date: Mon, 3 Apr 2017 12:58:46 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170403203355.6cbfc184@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <20170403203355.6cbfc184@JRWUBU2> Message-ID: An HTML attachment was scrubbed... URL: From olopierpa at gmail.com Mon Apr 3 15:04:54 2017 From: olopierpa at gmail.com (Pierpaolo Bernardi) Date: Mon, 3 Apr 2017 22:04:54 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170403203355.6cbfc184@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <20170403203355.6cbfc184@JRWUBU2> Message-ID: On Mon, Apr 3, 2017 at 9:33 PM, Richard Wordingham wrote: > On Sun, 2 Apr 2017 10:43:39 -0700 > Asmus Freytag wrote: > >> In these cases, explicit encoding would better cover what is desired: >> a reliable way to mark a distinction between different symbols (the >> two bishops are separate symbols, that also happen to express >> distinct, though related concepts -- it is not a single symbol with >> some ignorable attributes). > > There was no intention to encode the bishops separately. It just > happens that the rules of chess allow one to distinguish the bishops > simply by recording the colour of the square they are currently on. The rules of chess don't allow this. While at the start of a game there are two bishops per player with this property, there are ways to obtain more bishops. One player, say, can have four bishops all of them on light squares. This does not happen (usually :) in chess *games*, but it may happen in problems, puzzles, and retroanalysis. Even in standard games, it's not forbidden by the rules, so it's wrong to assume it can't happen. From verdy_p at wanadoo.fr Mon Apr 3 15:42:43 2017 From: verdy_p at wanadoo.fr (Philippe Verdy) Date: Mon, 3 Apr 2017 22:42:43 +0200 Subject: Tags and custom vector glyph emoji (from Re: Tailoring the Marketplace (is: Re: Unicode Emoji 5.0 characters now final)) In-Reply-To: <7794797.61788.1491243181509.JavaMail.defaultUser@defaultHost> References: <11364706.56745.1491240615392.JavaMail.root@webmail12.bt.ext.cpcloud.co.uk> <7794797.61788.1491243181509.JavaMail.defaultUser@defaultHost> Message-ID: 2017-04-03 20:13 GMT+02:00 William_J_G Overington : > > A tag sequence for this purpose starts with a capital letter V standing > for vector format. > > At the start of the sequence a:=255; b:=0; g:=0; m:=1; p:=0; r:=0; x:=0; > y:=0; w:=1000; > > At the start of the sequence the points buffer is empty, the contours > buffer is empty and the glyphs buffer is empty. > > The system uses a special-purpose virtual computing engine within a > software sandbox. The special-purpose virtual computing engine has no > commands for loops and is a single pass interpretative system. > > [...] What you are describing is reinventing the wheel, notably basically what SVG paths already define. But an amji is not just a path, it has also colors for fill them, stroke styles (may be converted to fill-only paths by computing the infered geometries), smoothing effects (color shades when they are not necessarily uniform). There are attempts to create a superset of SVG paths to represent it in more compact form with additional instructions, they are used to create "subroutines" or shared forms, affine transforms, geometric derivations (line width, dashes, bevel or rounded join types), and color masks (possibly with repeated patterns, and alpha transparencies). Every attempt to extend this has become a nightmare because there were too many objectives to follow. Finally eveyone uses SVG directly, even if this is currently XML encoded. More successful representation use JSON instead of XML, without breaking the extensibility. Font encoding technologies define their own system using multiple tables and a compact dictionnary of tables with binary encoding, not suitable for inclusion in plain-text. Note also that Emojis could be animated when rendered on screen (that's what we already see in many implementations using GIF icons for their emojis, even if they are not easily resizable). Animated SVG for now is still in beta but starts being used on some sites and rendered by web browsers. SVG images may also be scripted and may include accessbility feature (e.g. with sound played or hint bubbles displayed when hovering them). You only cover a part of what is needed but hope that someone will invest time to implet it in a renderer: developers prefer investing time in SVG renderers or existing font technologies for OpenType (SVG fonts will come later when it will be capable of doing the same things as OpenType, for now it does not cover all the existing needs). -------------- next part -------------- An HTML attachment was scrubbed... URL: From richard.wordingham at ntlworld.com Mon Apr 3 16:03:48 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Mon, 3 Apr 2017 22:03:48 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> Message-ID: <20170403220348.3efb4d1a@JRWUBU2> On Mon, 3 Apr 2017 14:12:52 +0200 Michael Everson wrote: > On 2 Apr 2017, at 18:27, Richard Wordingham > wrote: > I think you are seriously going the wrong way with this thinking. The > immediate parallel that comes to mind are things like: > > 1000 ? MYANMAR LETTER KA > ? 1000 FE00 ? dotted form > > where the character can still be read if the variation selector?s > glyph can?t be shown. Uniform width is a feature of CJK, sure, but > that?s the nature of the writing system. Chess pieces for setting > withing in ordinary text do NOT have to be an em wide, and they don?t > in fonts. Chess pieces on a white square or on a black square do have > to have a uniform width in order to produced the board matrix. Nobody said the glyphs for use in ordinary text had to be a fixed width. What I am saying is that the glyphs for the two new variants you are proposing need to harmonise with the block elements such as U+2581 LOWER ONE EIGHTH BLOCK. That requires uniform width *for those variants*. That is a key part of the glyph family's essence. There is no such requirement on the glyphs for normal text use as at present. > > U+00A0 makes a lot of sense as the base character. > > What? NBSP and SP are whitespace characters, with complex behaviours, > and chessboards, whether set in lead type or digitally, are sets of > simple symbol glyphs. NBSP glues two things together. SP separates > things. Chessboards are not collections of black squares glued > together by white spaces with white spaces at the alternating ends of > lines. I reject this analysis. If one had a row of squares in flowing text, one would want the row to act like a word. One might have to resort to gluing it together using CGJ or WJ. > > Also having variants of U+25A1 and U+25A8 that match the game > > square filter modifiers seems quite legitimate. > > Um, wait? What are you proposing NBSP for? I'm confused now. If you > like these two characters (and I am glad you do) there?s no need for > U+00A0 at all. To be pedantic, I said that the proposed variants were legitimate, not that I liked them. > > Secondly, the mechanism can only look for a substitute if it knows > > that the glyph is missing. > The macOS does this quite reliably. If Baskerville has no chess > piece, but Ludus does, then a text in Baskerville wlll usually > display the Ludus glyph. You can override this by selecting the Ludus > gyph and forcing it back to Baskerville and then you get a box or > other substitution glyph. I'm talking about looking for a U+2654 glyph for ordinary text when all the first font tried has is: 2654 FE01; Chesspiece on white; # WHITE CHESS KING 2654 FE02; Chesspiece on black; # WHITE CHESS KING I must confess I am now wondering what the format 4 cmap should say about U+2654. Should it give a glyph for U+2654 or not? I'm also wondering about Windows behaviour. There was a time when Windows 7 only supported variation sequences if they appeared in the cmap 14 subtable. > > If it's looking for an OpenType font for a glyph of the family > > , > > Or any OpenType substitution string. Most won't be recognised as needed. If the first font lacks a ligature for , fallback won't be used for it. Grapheme clusters and variation sequences get special treatment. Richard. From richard.wordingham at ntlworld.com Mon Apr 3 16:12:59 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Mon, 3 Apr 2017 22:12:59 +0100 Subject: Unicode 10.0 Legitimacy of 0031 FE0E 20E3 Message-ID: <20170403221259.40b3a1cb@JRWUBU2> Where in the draft databases for Unicode 10.0 is Unicode 9.0 variation sequence declared legitimate? Without such a declaration, a font that had a special glyph for or a substitution specific to would not be Unicode compliant. I hope this reflects my ignorance of the definition system rather than an error in the databases. Richard. From everson at evertype.com Mon Apr 3 16:15:16 2017 From: everson at evertype.com (Michael Everson) Date: Mon, 3 Apr 2017 22:15:16 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <742647d6-75f8-2f59-4b60-75a67ea73572@ix.netcom.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> <742647d6-75f8-2f59-4b60-75a67ea73572@ix.netcom.com> Message-ID: <5062E7FE-57DA-49A7-89C1-776D6CDE2E61@evertype.com> On 3 Apr 2017, at 17:16, Asmus Freytag wrote: >>> The same indirection is at play here. >> This is pure rhetoric, Asmus. It addresses the problem in no way. > Actually it does. I'm amazed that you don't see the connection. I?ve never understood you when you back up into that particular kind of abstract rhetoric. >>> the oft-stated fact that variation selectors may be ignored. >> I?m aware of this. I may be wrong, but I believe you advocated for the encoding of variation sequences for mathematics purposes. > > Yes, for those cases where the differences are known to not carry meaning, but where duplicating all fonts or duplicating the characters would have been the wrong solution to allow support for both conventions (e.g. upright vs. slanted integral signs, details of relational operator design, etc.). The ?meaning? of a chess-problem matrix is the whole 8 ? 8 board, not the empty dark square at b4 or the white pawn on The ?problem? the higher-level protocol is supposed to solve is the one where a chess piece of one colour sits in an em-squared zone whether light or dark. In lead type this was a glyph issue. Lead type had just exactly what my proposal has: A piece with in-line text metrics, spaced harmoniously with digits and letters, and square sorts with and without hatching. Standardized variation sequences are the best way to achieve this simply and without needless duplication. :-) >> Are you saying that the empty white and black squares should use VS but the chess pieces are not? That makes no sense to me at all. > > I'm saying that perhaps it would be appropriate to select M-square glyph variants via a variation selector. That seems a clear-cut glyph *variation* to me. (If this variation is ignored, then the text looks bad, but in a way that is similar to selecting the wrong font - which is a rule-of-thumb way of evaluating whether variation selectors are appropriate). OK, then you support the part of the proposal that applies VS1 and VS2 to the chess pieces. > The distinction between white/black background might be of a different nature. If you have arranged everything in a grid with the correct matrix, then the color of the background is perhaps redundant, given that there is a uniform convention for it. Yes but you still want it to be reasonably legible when the OpenType ligatures fail. I think that this: ?????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????? is far better than this: ?????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ??????????????????<< Is it the pawn or the queen that?s on the black square? ?????????????????? ?????????? See? To parse this one you have to remember which of the white squares are the alternating black ones. I don?t consider that as legible as using both 25A1 and 25A8. The colour of the matrix is NOT redundant for a human reader. > If you assume the characters will ever be used outside a full grid, then that assumption fails and it will not be possible to restore the intended meaning if the variation selectors are missing. That's a warning flag, that they may not be appropriate for that use. You can?t assume that they wouldn?t be. All of my examples in ?2 of the proposal are in fact outside of a full grid. I think the proposal as it stands ticks the most boxes. (I have changed ?black square? and ?white square? to ?dark square? and ?light square? however. Michael Everson From everson at evertype.com Mon Apr 3 16:16:04 2017 From: everson at evertype.com (Michael Everson) Date: Mon, 3 Apr 2017 22:16:04 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <88b9a791-4d5c-534f-160a-ec0f521135d1@ix.netcom.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <88b9a791-4d5c-534f-160a-ec0f521135d1@ix.netcom.com> Message-ID: <3BD3C1C7-9294-4FE9-9BFB-7FEF81706CD4@evertype.com> > On 3 Apr 2017, at 17:18, Asmus Freytag wrote: > > On 4/3/2017 5:12 AM, Michael Everson wrote: >>> I'm not convinced that it is. A player starts with two non-interchangeable bishops. could only refer the white bishop that is restricted to black squares. That's a semantic difference. >>> >> Surely not. If it were, we would encode WHITE BISHOP THAT STAYS ON THE WHITE SQUARES and WHITE BISHOP THAT STAYS ON BLACK SQUARES and we would encode WHITE KNIGHT THAT MOVES FROM WHITE SQUARES TO BLACK SQUARES and WHITE KNIGHT THAT MOVES FROM BLACK SQUARES TO WHITE SQUARES. >> > The non-interchangeability of bishops is a fact about chess rules. We agree. :-) > It has no business being "encoded" on the character level. We agree. :-) Michael Everson From kent.karlsson14 at telia.com Mon Apr 3 16:24:59 2017 From: kent.karlsson14 at telia.com (Kent Karlsson) Date: Mon, 03 Apr 2017 23:24:59 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: Message-ID: Den 2017-04-03 20:46, skrev "Kent Karlsson" : > > Den 2017-04-03 19:51, skrev "markus.icu at gmail.com" : > >> > It seems to me that higher-level layout (e.g, HTML+CSS) is appropriate for >> the >> > board layout (e.g., via a table), board frame style, and cell/field >> shading. >> > In each field, the existing characters should suffice. >> > >> > markus > > True, and one can easily find an example online. > > Slightly modified from > http://stackoverflow.com/questions/18505921/chess-using-tables > > [...] > A bit more modification: more colourful, even with /// striped backgrounds. One disadvantage is that the "white" pieces interior get the background colour rather than being actually white. To get them actually white (not just the interiors, but the entire pieces), use the "black"(!) pieces, and (via CSS) colour them white (need to be set on a non-white background to be visible...). I know, the latter trick will make parsing even more tricky (needing to interpret not only the HTML tag markup and chess characters, but also (say) HTML class attribute to distinguish "white" from "black" pieces). And, parsing (for other things than display in a browser), will be quite sensitive to the exact way of expressing this in HTML. There are many quite different ways of expressing this in HTML (+CSS). But... with a bit of JavaScript savvyness, you can program moving the pieces around... ;-) And substitute the chess characters to more emoji style images of chess pieces... Still in ;-) mode.

♜	♞	♝	♛	♚	♝	♞	♜
♟	♟	♟	♟	♟	♟	♟	♟




♙	♙	♙	♙	♙	♙	♙	♙
♖	♘	♗	♕	♔	♗	♘	♖

-------------- next part -------------- An HTML attachment was scrubbed... URL: From everson at evertype.com Mon Apr 3 16:33:53 2017 From: everson at evertype.com (Michael Everson) Date: Mon, 3 Apr 2017 22:33:53 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <88b9a791-4d5c-534f-160a-ec0f521135d1@ix.netcom.com> Message-ID: <8A987C4A-CF9E-4EA0-A3C4-99B0DD8CEFDD@evertype.com> On 3 Apr 2017, at 18:51, Markus Scherer wrote: > > It seems to me that higher-level layout (e.g, HTML+CSS) is appropriate for the board layout (e.g., via a table), board frame style, and cell/field shading. In each field, the existing characters should suffice. That isn?t plain text. This is plain text: ?????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????? I can read this in my plain-text e-mail. I can copy it from the plain-text e-mail and past it into Quark XPress as in the proposal, or into Microsoft Word v. 15 for Mac as shown below (the first one is just as-is pasted into Word; the second formatted itself when I selected the Ludus font. None of these examples uses HTML. None uses some external folder with hard-to-format css rules. None needs to be constructed by some HTML or XML

♜	♞	♝	♛	♚	♝	♞	♜
♟	♟	♟	♟	♟	♟	♟	♟




♙	♙	♙	♙	♙	♙	♙	♙
♖	♘	♗	♕	♔	♗	♘	♖

matrix. It?s all just a font with normal OpenType features, and normal use of variation sequences. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: ms-word-lg.png Type: image/png Size: 101582 bytes Desc: not available URL: From liancu at microsoft.com Mon Apr 3 16:37:10 2017 From: liancu at microsoft.com (Laurentiu Iancu) Date: Mon, 3 Apr 2017 21:37:10 +0000 Subject: Unicode 10.0 Legitimacy of 0031 FE0E 20E3 In-Reply-To: <20170403221259.40b3a1cb@JRWUBU2> References: <20170403221259.40b3a1cb@JRWUBU2> Message-ID: Richard, The emoji and text presentation sequences were moved to the UTS #51 data file emoji-variation-sequences.txt, which is new in Version 5.0 of the UTS. Please see http://www.unicode.org/Public/emoji/5.0/emoji-variation-sequences.txt The move is documented on the Beta Unicode 10.0 page, http://www.unicode.org/versions/beta-10.0.0.html in the "Standardized Variation Sequences" section. Regards, L. -----Original Message----- From: Unicode [mailto:unicode-bounces at unicode.org] On Behalf Of Richard Wordingham Sent: Monday, April 3, 2017 2:13 PM To: unicode at unicode.org Subject: Unicode 10.0 Legitimacy of 0031 FE0E 20E3 Where in the draft databases for Unicode 10.0 is Unicode 9.0 variation sequence declared legitimate? Without such a declaration, a font that had a special glyph for or a substitution specific to would not be Unicode compliant. I hope this reflects my ignorance of the definition system rather than an error in the databases. Richard. -------------- next part -------------- An HTML attachment was scrubbed... URL: From markus.icu at gmail.com Mon Apr 3 16:44:28 2017 From: markus.icu at gmail.com (Markus Scherer) Date: Mon, 3 Apr 2017 14:44:28 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <8A987C4A-CF9E-4EA0-A3C4-99B0DD8CEFDD@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <88b9a791-4d5c-534f-160a-ec0f521135d1@ix.netcom.com> <8A987C4A-CF9E-4EA0-A3C4-99B0DD8CEFDD@evertype.com> Message-ID: On Mon, Apr 3, 2017 at 2:33 PM, Michael Everson wrote: > On 3 Apr 2017, at 18:51, Markus Scherer wrote: > > > It seems to me that higher-level layout (e.g, HTML+CSS) is appropriate for > the board layout (e.g., via a table), board frame style, and cell/field > shading. In each field, the existing characters should suffice. > > > That isn?t plain text. > A lot of stuff needed for printing books and laying out PDFs and web pages goes beyond plain text. Whose requirement is it to represent an entire chess or checkers board in plain text? Other than a sort of puzzle of "what would it take to do so?" markus -------------- next part -------------- An HTML attachment was scrubbed... URL: From everson at evertype.com Mon Apr 3 16:48:31 2017 From: everson at evertype.com (Michael Everson) Date: Mon, 3 Apr 2017 22:48:31 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170403203355.6cbfc184@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <20170403203355.6cbfc184@JRWUBU2> Message-ID: <9B849393-FEA0-43CE-A4E2-CA63DBAAF425@evertype.com> On 3 Apr 2017, at 20:33, Richard Wordingham wrote: > There was no intention to encode the bishops separately. It just happens that the rules of chess allow one to distinguish the bishops > simply by recording the colour of the square they are currently on. That only works for them though. > The basic text elements in the scheme other than boundary markers will be: > > empty white square > empty black square > white square with specific piece on it > black square with specific piece on it. Or rather, in terms of the font glyphs: light square dark square specific piece surrounded by light square specific piece surrounded by dark square > If the variation selectors are ignored, these simplify to: > > white square > hatched square > specific piece > > This preserves all the information; the pattern of squares is known in advance and therefore redundant. Yes, this is what I?ve proposed. Michael Everson From everson at evertype.com Mon Apr 3 16:52:55 2017 From: everson at evertype.com (Michael Everson) Date: Mon, 3 Apr 2017 22:52:55 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <20170403203355.6cbfc184@JRWUBU2> Message-ID: On 3 Apr 2017, at 20:58, Asmus Freytag wrote: > > On 4/3/2017 12:33 PM, Richard Wordingham wrote: >> If the variation selectors are ignored, these simplify to: >> >> white square >> hatched square >> specific piece >> >> This preserves all the information; the pattern of squares is known in advance and therefore redundant. >> > This assumes that you always show the full board. True, and see the two short examples at the top of page 5 of the proposal, and see the 12?12 board in Figure 5. > Under that assumption, you are correct. > > The variation selectors would then not needed, even, in the text: style markup could supply them in all cases where the data isn't raw text. What style markup? Nothing is defined; nothing is portable. If we use VS and put the burden on the font, we actually do the same thing that traditional lead-type setters did. > They would essentially only live in the data stream to the rendering engine, to force glyph selection, but not need to be part of the text. > > Interesting, Not entirely sure I follow, but? well, I like the proposal best because it is robust, easy to learn, and easy to use. Michael Everson From asmusf at ix.netcom.com Mon Apr 3 17:07:38 2017 From: asmusf at ix.netcom.com (Asmus Freytag (c)) Date: Mon, 3 Apr 2017 15:07:38 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <5062E7FE-57DA-49A7-89C1-776D6CDE2E61@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> <742647d6-75f8-2f59-4b60-75a67ea73572@ix.netcom.com> <5062E7FE-57DA-49A7-89C1-776D6CDE2E61@evertype.com> Message-ID: On 4/3/2017 2:15 PM, Michael Everson wrote: > On 3 Apr 2017, at 17:16, Asmus Freytag wrote: > >>>> The same indirection is at play here. >>> This is pure rhetoric, Asmus. It addresses the problem in no way. >> Actually it does. I'm amazed that you don't see the connection. > I?ve never understood you when you back up into that particular kind of abstract rhetoric. Sometimes thinking through something in abstract terms actually clarifies the situation. > >>>> the oft-stated fact that variation selectors may be ignored. >>> I?m aware of this. I may be wrong, but I believe you advocated for the encoding of variation sequences for mathematics purposes. >> Yes, for those cases where the differences are known to not carry meaning, but where duplicating all fonts or duplicating the characters would have been the wrong solution to allow support for both conventions (e.g. upright vs. slanted integral signs, details of relational operator design, etc.). > The ?meaning? of a chess-problem matrix is the whole 8 ? 8 board, not the empty dark square at b4 or the white pawn on In other words, you assert that partial boards never need to be displayed. (Let's take that as read, then). > > The ?problem? the higher-level protocol is supposed to solve is the one where a chess piece of one colour sits in an em-squared zone whether light or dark. In lead type this was a glyph issue. Lead type had just exactly what my proposal has: A piece with in-line text metrics, spaced harmoniously with digits and letters, and square sorts with and without hatching. Leaving aside the abstract question whether modeling lead type is ipso facto the best solution in all cases... > > Standardized variation sequences are the best way to achieve this simply and without needless duplication. :-) > >>> Are you saying that the empty white and black squares should use VS but the chess pieces are not? That makes no sense to me at all. >> I'm saying that perhaps it would be appropriate to select M-square glyph variants via a variation selector. That seems a clear-cut glyph *variation* to me. (If this variation is ignored, then the text looks bad, but in a way that is similar to selecting the wrong font - which is a rule-of-thumb way of evaluating whether variation selectors are appropriate). > OK, then you support the part of the proposal that applies VS1 and VS2 to the chess pieces. My statement just was that a proposal where piece + VS should be M-square, piece w/o VS should be generic, might make some sense (and same for a suitable "empty" cell). The next question would be whether the alternation in background is best expressed in variation sequences or by some other means. If you never need to show just a single field, then I concede that the main drawback of variation selectors for the background style is absent; however, reading ahead in your message, the partial grid appears to be common, therefore the reason to choose an alternate solution to the background style is a strong one. > >> The distinction between white/black background might be of a different nature. If you have arranged everything in a grid with the correct matrix, then the color of the background is perhaps redundant, given that there is a uniform convention for it. > Yes but you still want it to be reasonably legible when the OpenType ligatures fail. I think that this: > > ?????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????? > is far better than this: > ?????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????????????? > ??????????????????<< Is it the pawn or the queen that?s on the black square? > ?????????????????? > ?????????? > See? To parse this one you have to remember which of the white squares are the alternating black ones. I don?t consider that as legible as using both 25A1 and 25A8. > > The colour of the matrix is NOT redundant for a human reader. OK -- in that case you've actually made an argument for *duplicating *the codes for *ALL *the pieces (as well as the empties). That way, you are guaranteed that (if the font supports the glyphs) you get what you want. With variation selectors for background color, you do not get what you want for the pieces. Having the system use specific character codes for the empties and variation selectors for the pieces is a needless complication; just duplicate the few pieces with a hatched background. (The precise style of hatching should be left to the font - that's not something that you specify in plain text). Leave the question of requesting M-square metrics to a (single) variation selector and you are done. (the convention would be that 25A8 + VS results in an M-square glyph using some hatching that matches that of the hatched code points for chess pieces, not necessarily matching the hatching style that you get for 25A8 w/o the VS). (Alternatively, you could add a code for "dark cell" so that the hatching can be anything whether or not there's VS). Now, this model is much closer to the way VSs are used for math operators (but the reasoning may be a bit abstract, so I won't bother you with it here). > >> If you assume the characters will ever be used outside a full grid, then that assumption fails and it will not be possible to restore the intended meaning if the variation selectors are missing. That's a warning flag, that they may not be appropriate for that use. > You can?t assume that they wouldn?t be. All of my examples in ?2 of the proposal are in fact outside of a full grid. I think the proposal as it stands ticks the most boxes. (I have changed ?black square? and ?white square? to ?dark square? and ?light square? however. If the proposal duplicates the pieces that are on dark squares and does not use any VS sequences to select the color of the square (but only to select the M-square metrics) it would be more robust and less complex to implement. (A chess font would not need to do anything but provide the right glyphs and ignore the VS, because they would be in M-squar metrics anyway). A./ -------------- next part -------------- An HTML attachment was scrubbed... URL: From richard.wordingham at ntlworld.com Mon Apr 3 17:28:05 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Mon, 3 Apr 2017 23:28:05 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <9B849393-FEA0-43CE-A4E2-CA63DBAAF425@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <20170403203355.6cbfc184@JRWUBU2> <9B849393-FEA0-43CE-A4E2-CA63DBAAF425@evertype.com> Message-ID: <20170403232805.6e90972a@JRWUBU2> On Mon, 3 Apr 2017 22:48:31 +0100 Michael Everson wrote: > Yes, this is what I?ve proposed. I was explaining it to Asmus and others with similar misunderstandings. Richard. From richard.wordingham at ntlworld.com Mon Apr 3 17:34:31 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Mon, 3 Apr 2017 23:34:31 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> <742647d6-75f8-2f59-4b60-75a67ea73572@ix.netcom.com> <5062E7FE-57DA-49A7-89C1-776D6CDE2E61@evertype.com> Message-ID: <20170403233431.5e5a4858@JRWUBU2> On Mon, 3 Apr 2017 15:07:38 -0700 "Asmus Freytag (c)" wrote: > Having the system use specific character codes for the empties and > variation selectors for the pieces is a needless complication; just > duplicate the few pieces with a hatched background. (The precise > style of hatching should be left to the font - that's not something > that you specify in plain text). > Leave the question of requesting M-square metrics to a (single) > variation selector and you are done. This solution quiets my qualms. Richard. From everson at evertype.com Mon Apr 3 17:35:52 2017 From: everson at evertype.com (Michael Everson) Date: Mon, 3 Apr 2017 23:35:52 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170403220348.3efb4d1a@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <20170403220348.3efb4d1a@JRWUBU2> Message-ID: <421BC3D3-DF71-4D76-93E7-CAFDEDFBFFCB@evertype.com> On 3 Apr 2017, at 22:03, Richard Wordingham wrote: > Nobody said the glyphs for use in ordinary text had to be a fixed width. That?s why there?s a non-variant state and then two ?on-square? variant states. If you want to construct a chessboard using a font, whether it is using an ASCII font or using Unicode characters with variation selectors, the glyphs in that context have to be fixed width (if you want, you know, a square chess board). > What I am saying is that the glyphs for the two new variants you are proposing need to harmonise with the block elements such as U+2581 > LOWER ONE EIGHTH BLOCK. No? in a chess font the font designer has to draw those block-element characters differently, to harmonize with the > That requires uniform width *for those variants*. That is a key part of the glyph family's essence. In their original usage in graphic terminals, sure. And some people still emulate those, and when they use those characters they draw them for that purpose. In current ASCII-based chess-fonts, a set of characters is used to draw a line (of one kind or another) around the board, and when I looked for Unicode characters to map to these, the block elements were the ones that had the right structure, since they were high and low and left and right in the em square. > There is no such requirement on the glyphs for normal text use as at present. There is **in a chess font** if you want to be able to draw a box around the chessboard. All the ASCII-based chess fonts have glyphs for this. In the Danish Skak font (see Figure 3) the eight ASCII characters 9, _, ), |, \, 0, -, and = are used. In my proposal, I use eight Block Element characters. It works, and is flexible enough even to cater to ornate frames. > If one had a row of squares in flowing text, one would want the row to act like a word. One might have to resort to gluing it together using CGJ or WJ. What are you on about? I?m talking about making 8?8 tables, not flowing rows of chessboards within a paragraph. I mean, sure, if you wanted to do that, you?d run into line-breaking weirdness, but nobody would do that, and so that weird situation just doesn?t matter. All I was saying is that SPACE and NBSP aren?t the right characters to use for the light squares on a game board. >>> Also having variants of U+25A1 and U+25A8 that match the game square filter modifiers seems quite legitimate. >> >> Um, wait? What are you proposing NBSP for? I'm confused now. If you like these two characters (and I am glad you do) there?s no need for >> U+00A0 at all. > > To be pedantic, I said that the proposed variants were legitimate, not that I liked them. Um, ok. I don?t see that?s helpful in terms of improving or modifying the proposal. I stand by my proposal, which I have implemented successfully, even quickly as with William?s Quest font. > I'm talking about looking for a U+2654 glyph for ordinary text when all the first font tried has is: > > 2654 FE01; Chesspiece on white; # WHITE CHESS KING > 2654 FE02; Chesspiece on black; # WHITE CHESS KING > > I must confess I am now wondering what the format 4 cmap should say about U+2654. I really don?t know about the ?format 4 cmap? text. I copied it from a successful VS proposal by Ken Lunde of Adobe. What I used was the liga and rlig tables. I didn?t edit any cmap table per se and don?t know how to do it. Without any VS character, 2654 just renders like an ordinary white king as drawn in the font. It only goes to a light or dark board-square glyph with the VS. > Should it give a glyph for U+2654 or not? Of course. Why wouldn't it? It?s a graphic character. > I'm also wondering about Windows behaviour. There was a time when Windows 7 only supported variation sequences if they appeared in the cmap 14 subtable. I don?t know. Older software often doesn?t support this. Quark XPress, which has become a completely awesome typesetting program, used to be terrible at it. Maybe people typesetting chessboards would have to use something other than some apps on Windows 7, or maybe something other than Windows 7 entirely. I can?t use Unicode at all really on Mac OS 9, which I use rom time to time. >>> If it's looking for an OpenType font for a glyph of the family , >> >> Or any OpenType substitution string. > > Most won't be recognised as needed. If the first font lacks a ligature for , fallback won't be used for it. Grapheme clusters and > variation sequences get special treatment. I don?t see how anything you?re saying either identifies or tried to solve any actual problem with the proposal. The proposal says ?put some substitution tables into your chess font to display a particular glyph? and some apps do that and some don?t. You can?t use VS with apps that don't. Michael Everson From everson at evertype.com Mon Apr 3 17:46:57 2017 From: everson at evertype.com (Michael Everson) Date: Mon, 3 Apr 2017 23:46:57 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <88b9a791-4d5c-534f-160a-ec0f521135d1@ix.netcom.com> <8A987C4A-CF9E-4EA0-A3C4-99B0DD8CEFDD@evertype.com> Message-ID: Markus, that there exist dozens of fonts designed for chessboard typesetting should suggest that people wish to use computers to do so. There are many, many volumes published on chess problems and there are some people who are passionately interested in that very specific intellectual pursuit. I really can?t see any reason to second-guess or oppose a desire to have simple and standardized way of representing that kind of data in legible plain text whose legibility can be optimized via a standardized font mechanism. Michael Everson > On 3 Apr 2017, at 22:44, Markus Scherer wrote: > > On Mon, Apr 3, 2017 at 2:33 PM, Michael Everson wrote: > On 3 Apr 2017, at 18:51, Markus Scherer wrote: >> >> It seems to me that higher-level layout (e.g, HTML+CSS) is appropriate for the board layout (e.g., via a table), board frame style, and cell/field shading. In each field, the existing characters should suffice. > > That isn?t plain text. > > A lot of stuff needed for printing books and laying out PDFs and web pages goes beyond plain text. > > Whose requirement is it to represent an entire chess or checkers board in plain text? > > Other than a sort of puzzle of "what would it take to do so?" > > markus From asmusf at ix.netcom.com Mon Apr 3 17:53:42 2017 From: asmusf at ix.netcom.com (Asmus Freytag) Date: Mon, 3 Apr 2017 15:53:42 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170403232805.6e90972a@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <20170403203355.6cbfc184@JRWUBU2> <9B849393-FEA0-43CE-A4E2-CA63DBAAF425@evertype.com> <20170403232805.6e90972a@JRWUBU2> Message-ID: <76a2f9a0-be7a-47dd-5144-c2309a13a20d@ix.netcom.com> An HTML attachment was scrubbed... URL: From everson at evertype.com Mon Apr 3 18:30:30 2017 From: everson at evertype.com (Michael Everson) Date: Tue, 4 Apr 2017 00:30:30 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> <742647d6-75f8-2f59-4b60-75a67ea73572@ix.netcom.com> <5062E7FE-57DA-49A7-89C1-776D6CDE2E61@evertype.com> Message-ID: <915358C1-319D-4494-A915-2FAA557F8840@evertype.com> On 3 Apr 2017, at 23:07, Asmus Freytag (c) wrote: > > On 4/3/2017 2:15 PM, Michael Everson wrote: >> On 3 Apr 2017, at 17:16, Asmus Freytag wrote: >> >>>>> The same indirection is at play here. >>>>> >>>> This is pure rhetoric, Asmus. It addresses the problem in no way. >>>> >>> Actually it does. I'm amazed that you don't see the connection. >>> >> I?ve never understood you when you back up into that particular kind of abstract rhetoric. > > Sometimes thinking through something in abstract terms actually clarifies the situation. Of course I know that?s your view. It?s just never been an effective communication strategy between you and me generally. >>> The ?meaning? of a chess-problem matrix is the whole 8 ? 8 board, not the empty dark square at b4 or the white pawn on > > In other words, you assert that partial boards never need to be displayed. (Let's take that as read, then). No, I am sure that a variety of board shapes can be set in plain text with these conventions, though the principle concern is classical chess notation. >> The ?problem? the higher-level protocol is supposed to solve is the one where a chess piece of one colour sits in an em-squared zone whether light or dark. In lead type this was a glyph issue. Lead type had just exactly what my proposal has: A piece with in-line text metrics, spaced harmoniously with digits and letters, and square sorts with and without hatching. > > Leaving aside the abstract question whether modeling lead type is ipso facto the best solution in all cases? I think it was a good expedient solution in lead type and that this proposal offers a robust parseable digital version of that solution, and I assert people will make use of that data structure. >> OK, then you support the part of the proposal that applies VS1 and VS2 to the chess pieces. > > My statement just was that a proposal where piece + VS should be M-square, piece w/o VS should be generic, might make some sense (and same for a suitable "empty" cell). > > The next question would be whether the alternation in background is best expressed in variation sequences or by some other means. I think the value in the data structures I have described is best retained as text. Anything else just seems it would be simply needlessly complex, > If you never need to show just a single field, then I concede that the main drawback of variation selectors for the background style is absent; however, reading ahead in your message, the partial grid appears to be common, therefore the reason to choose an alternate solution to the background style is a strong one. Well, it?s text, Asmus, so you can delete all but one line of a board if you want: ?????????????????? There. So? what are you talking about? It?s a text matrix. It?s like a kind of poem. ?????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????????????? ?????????? It even looks like one. That?s a meaningful pattern. A kind of writing system. >> The colour of the matrix is NOT redundant for a human reader. >> > OK -- in that case you've actually made an argument for duplicating the codes for ALL the pieces (as well as the empties). Why? It?s text. It?s spelling. These structures are read. There?s no reason to encode two letter C?s because one is pronounced [k] and one [s]. > That way, you are guaranteed that (if the font supports the glyphs) you get what you want. Then you?d have to have three, because there are three kinds of things that need to be in a single font: by itself, on a light square, and on a dark square. > With variation selectors for background color, you do not get what you want for the pieces. I implemented it! It works! > Having the system use specific character codes for the empties and variation selectors for the pieces is a needless complication; just duplicate the few pieces with a hatched background. (The precise style of hatching should be left to the font - that's not something that you specify in plain text). Your idea really isn?t better. > Leave the question of requesting M-square metrics to a (single) variation selector and you are done. (the convention would be that 25A8 + VS results in an M-square glyph using some hatching that matches that of the hatched code points for chess pieces, not necessarily matching the hatching style that you get for 25A8 w/o the VS). (Alternatively, you could add a code for "dark cell" so that the hatching can be anything whether or not there's VS). You want WHITE CHESS KNIGHT, and WHITE CHESS KNIGHT ON SQUARE, and use a VS that changes the colour of the square? That is less legible in plain text than my proposal. Not as good. Detrimental to the user indeed. > Now, this model is much closer to the way VSs are used for math operators (but the reasoning may be a bit abstract, so I won't bother you with it here). I don?t agree that your model is better than mine. Interesting, but not better. > If the proposal duplicates the pieces that are on dark squares and does not use any VS sequences to select the color of the square (but only to select the M-square metrics) it would be more robust and less complex to implement. (A chess font would not need to do anything but provide the right glyphs and ignore the VS, because they would be in M-squar metrics anyway). Then you?re still stuck for a solution for non-em-square characters for inline text. Michael Everson From everson at evertype.com Mon Apr 3 18:31:26 2017 From: everson at evertype.com (Michael Everson) Date: Tue, 4 Apr 2017 00:31:26 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170403233431.5e5a4858@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> <742647d6-75f8-2f59-4b60-75a67ea73572@ix.netcom.com> <5062E7FE-57DA-49A7-89C1-776D6CDE2E61@evertype.com> <20170403233431.5e5a4858@JRWUBU2> Message-ID: <87747D84-2238-47DF-AFF1-D181FC726CFA@evertype.com> On 3 Apr 2017, at 23:34, Richard Wordingham wrote: > >> Leave the question of requesting M-square metrics to a (single) variation selector and you are done. > > This solution quiets my qualms. It does not meet my requirement, and it solves no problem. Michael Everson From kent.karlsson14 at telia.com Mon Apr 3 18:45:01 2017 From: kent.karlsson14 at telia.com (Kent Karlsson) Date: Tue, 04 Apr 2017 01:45:01 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: Message-ID: I can well imagine people deeply interested in chess, to want to exchange chess board layouts in plain text emails (or at least not use quite hard-to-handle HTML code), and even parse them (programmatically) for analysis by a program, not wanting to bother with quite complex HTML/CSS stuff. Including making input easy (keyboard, palette), just "typing" the chess board layout (with pieces). But for HTML pages on chess, HTML/CSS markup is certainly preferable; but it shouldn't be impossible to just paste in a "plain text" chess board to an HTML page (with minimal formatting effort). One can (fairly easily) make a program to convert the "plain text" chess board to an HTML one. Book formatting? Old style book formatting still cannot use as sophisticated layouts as HTML can... (AFAIK). /Kent K Den 2017-04-03 23:44, skrev "markus.icu at gmail.com" : > On Mon, Apr 3, 2017 at 2:33 PM, Michael Everson wrote: >> On 3 Apr 2017, at 18:51, Markus Scherer wrote: >>> >>> It seems to me that higher-level layout (e.g, HTML+CSS) is appropriate for >>> the board layout (e.g., via a table), board frame style, and?cell/field >>> shading.?In each field, the existing characters should suffice. >> >> That isn?t plain text. > > A lot of stuff needed for printing books and laying out PDFs and web pages > goes beyond plain text. > > Whose requirement is it to represent an entire chess or checkers board in > plain text? > > Other than a sort of puzzle of "what would it take to do so?" > > markus > -------------- next part -------------- An HTML attachment was scrubbed... URL: From richard.wordingham at ntlworld.com Mon Apr 3 18:47:01 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Tue, 4 Apr 2017 00:47:01 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <421BC3D3-DF71-4D76-93E7-CAFDEDFBFFCB@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> <742647d6-75f8-2f59-4b60-75a67ea73572@ix.netcom.com> <5062E7FE-57DA-49A7-89C1-776D6CDE2E61@evertype.com> <915358C1-319D-4494-A915-2FAA557F8840@evertype.com> Message-ID: <20170404005942.4aa42021@JRWUBU2> On Tue, 4 Apr 2017 00:30:30 +0100 Michael Everson wrote: > On 3 Apr 2017, at 23:07, Asmus Freytag (c) > wrote: > You want WHITE CHESS KNIGHT, and WHITE CHESS KNIGHT ON SQUARE, and > use a VS that changes the colour of the square? That is less legible > in plain text than my proposal. Not as good. Detrimental to the user > indeed. No, he wants two characters WHITE CHESS KNIGHT and WHITE CHESS KNIGHT ON DARK BACKGROUND, and a variation selector, say VS2, that when applied to them yields a glyph that works with block elements. It might be simpler if WHITE CHESS KNIGHT ON DARK BACKGROUND was defined as a character that worked with block elements. > Then you?re still stuck for a solution for non-em-square characters > for inline text. No, WHITE CHESS KNIGHT should continue to fulfil that role. My only worry is that one might need a variation selector, say VS1, to force the choice of a suitable glyph. Richard. From everson at evertype.com Mon Apr 3 19:06:16 2017 From: everson at evertype.com (Michael Everson) Date: Tue, 4 Apr 2017 01:06:16 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <76a2f9a0-be7a-47dd-5144-c2309a13a20d@ix.netcom.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <20170403203355.6cbfc184@JRWUBU2> <9B849393-FEA0-43CE-A4E2-CA63DBAAF425@evertype.com> <20170403232805.6e90972a@JRWUBU2> <76a2f9a0-be7a-47dd-5144-c2309a13a20d@ix.netcom.com> Message-ID: <7E773027-B43E-458D-B234-7D8F0B6BE34E@evertype.com> On 3 Apr 2017, at 23:53, Asmus Freytag wrote: > Alternatively, a system that uses no Variation selectors and only relies on Opentype ligatures might work even better. > > This would require one Empty and one Filled board cell, to ligate with whatever piece is supposed to sit on top of it. The use of Empty / Filled board cell would result in the correct metrics and by encoding an empty cell that is different from a "white square", there's no need to overload the use of the latter. That would not be more legible ? quite the opposite, in fact ? because it would add variable length to lines for every character in the 8 ? 8 matrix in any environment where the ligation failed. That?s not chess-legible. Michael Everson From everson at evertype.com Mon Apr 3 19:10:15 2017 From: everson at evertype.com (Michael Everson) Date: Tue, 4 Apr 2017 01:10:15 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: Message-ID: On 4 Apr 2017, at 00:45, Kent Karlsson wrote: > > Book formatting? Old style book formatting still cannot use as sophisticated layouts as HTML can... (AFAIK). Yeah, but come on, the chief use of chess characters is to cite them inline in text like any other symbol @ ? % & and the other equally chief use of chess characters is to set 8 ? 8 chessboards which float in space in the layout as figures. The layout requirement isn?t all that demanding that HTML offers a major advantage. Michael Everson From everson at evertype.com Mon Apr 3 19:15:19 2017 From: everson at evertype.com (Michael Everson) Date: Tue, 4 Apr 2017 01:15:19 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170404004701.19ad750c@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> <742647d6-75f8-2f59-4b60-75a67ea73572@ix.netcom.com> <5062E7FE-57DA-49A7-89C1-776D6CDE2E61@evertype.com> <915358C1-319D-4494-A915-2FAA557F8840@evertype.com> <20170404005942.4aa42021@JRWUBU2> Message-ID: <40133EC9-7DF7-40D5-B243-02DB62693B7F@evertype.com> On 4 Apr 2017, at 00:59, Richard Wordingham wrote: > No, he wants two characters WHITE CHESS KNIGHT and WHITE CHESS KNIGHT ON DARK BACKGROUND, and a variation selector, say VS2, that when applied to them yields a glyph that works with block elements. > > It might be simpler if WHITE CHESS KNIGHT ON DARK BACKGROUND was defined as a character that worked with block elements. I can?t fathom how you would configure a font to do whatever it is you think you?re describing here. I don?t follow it. ?worked with which block elements, to do what? If it?s draw a box around the board, I already said, the answer is to change the graphics terminal block elements because in a chess-font environment their positional function is used, not their graphics terminal glyph. >> Then you?re still stuck for a solution for non-em-square characters for inline text. > > No, WHITE CHESS KNIGHT should continue to fulfil that role. My only worry is that one might need a variation selector, say VS1, to force the choice of a suitable glyph. I don?t get what you?re on about. I?ve already solved this problem, and whatever it is you?re describing sure doesn?t sound intuitive. I?ve shown my implementations which do what I need them to do. I don?t know if you can do the same, but go ahead and make your font to prove it, and write it up clearly in a counter-proposal if you think it?s the right way to . Michael Everson From everson at evertype.com Mon Apr 3 19:30:05 2017 From: everson at evertype.com (Michael Everson) Date: Tue, 4 Apr 2017 01:30:05 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170404004701.19ad750c@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <20170403220348.3efb4d1a@JRWUBU2> <421BC3D3-DF71-4D76-93E7-CAFDEDFBFFCB@evertype.com> <20170404004701.19ad750c@JRWUBU2> Message-ID: > I'm trying to work out whether we need a variation sequence for > "chesspiece in a sentence?. Of course! Haven?t you ever seen chess problem texts? Check out the Fairy Chess proposal for encoding additional characters. Plenty of examples there. Sorry, I meant ?Of course **not**!? that is, chesspiece in a sentence is extremely common, and should be the default (not stylized) form. We can?t repurpose that to be ?chesspiece on a white square? because it hasn?t been previously and changing that would affect the layout of existing data. Michael Everson -------------- next part -------------- An HTML attachment was scrubbed... URL: From kent.karlsson14 at telia.com Mon Apr 3 20:01:33 2017 From: kent.karlsson14 at telia.com (Kent Karlsson) Date: Tue, 04 Apr 2017 03:01:33 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: Message-ID: Den 2017-04-04 02:10, skrev "Michael Everson" : > On 4 Apr 2017, at 00:45, Kent Karlsson wrote: >> >> Book formatting? Old style book formatting still cannot use as sophisticated >> layouts as HTML can... (AFAIK). > > Yeah, but come on, the chief use of chess characters is to cite them inline in > text like any other symbol @ ? % & and the other equally chief use of chess > characters is to set 8 ? 8 chessboards which float in space in the layout as > figures. The layout requirement isn?t all that demanding that HTML offers a > major advantage. In case you missed it, the statement I made above was in *SUPPORT* of your proposal (in general, but not necessarily all details)... /Kent K > Michael Everson From everson at evertype.com Mon Apr 3 20:12:16 2017 From: everson at evertype.com (Michael Everson) Date: Tue, 4 Apr 2017 02:12:16 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: Message-ID: > On 4 Apr 2017, at 02:01, Kent Karlsson wrote: > >>> Book formatting? Old style book formatting still cannot use as sophisticated layouts as HTML can... (AFAIK). >> >> Yeah, but come on, the chief use of chess characters is to cite them inline in text like any other symbol @ ? % & and the other equally chief use of chess characters is to set 8 ? 8 chessboards which float in space in the layout as figures. The layout requirement isn?t all that demanding that HTML offers a major advantage. > > In case you missed it, the statement I made above was in *SUPPORT* of your proposal (in general, but not necessarily all details)? It?s not easy to tell because couterapproaches suggested are not well specified and really don?t seem to be practical. It *is* important that there be an even number of characters in every row of 8 squares for fallback display to be better rather than worse, I think. I don?t think it?s possible to ensure that the rendering engine every app displays the fallback identically (Seems that Word and LibreOffice and Pages and Quark display a little differently; this seems to be that they load glyphs from some fonts before glyphs from others. I found while setting the tables that it was convenient to have to remember that every one of the 64 characters had to have VS1 or VS2 along with it. Constructing a table from scratch and modifying and existing one both felt easier with uniform encoding. Michael Everson From asmusf at ix.netcom.com Mon Apr 3 20:21:38 2017 From: asmusf at ix.netcom.com (Asmus Freytag) Date: Mon, 3 Apr 2017 18:21:38 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <915358C1-319D-4494-A915-2FAA557F8840@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> <742647d6-75f8-2f59-4b60-75a67ea73572@ix.netcom.com> <5062E7FE-57DA-49A7-89C1-776D6CDE2E61@evertype.com> <915358C1-319D-4494-A915-2FAA557F8840@evertype.com> Message-ID: An HTML attachment was scrubbed... URL: From kent.karlsson14 at telia.com Mon Apr 3 20:51:53 2017 From: kent.karlsson14 at telia.com (Kent Karlsson) Date: Tue, 04 Apr 2017 03:51:53 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: Message-ID: Den 2017-04-04 03:21, skrev "Asmus Freytag" : > would look like this, if you base your proposal on ligatures rather than > variation selectors (minimal case A above): > > ?????????????????????? That line has a lot of VSs in it... (I see them, since they happen to be visible in the email app I use.) > The disadvantage is that the fallback rendering does not line up; but I would > regard that as a minor issue. I think Michael regards that non-lineup as a show-stopper. /Kent K -------------- next part -------------- An HTML attachment was scrubbed... URL: From kent.karlsson14 at telia.com Mon Apr 3 20:51:57 2017 From: kent.karlsson14 at telia.com (Kent Karlsson) Date: Tue, 04 Apr 2017 03:51:57 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: Message-ID: Den 2017-04-04 03:12, skrev "Michael Everson" : > It *is* important that there be an even number of characters in every row of 8 > squares for fallback display to be better rather than worse, I think. I agree. (Though *at present*, I happen to get a visible display of the VSs in the email app, which does not look too good.) > I found while setting the tables that it was convenient to have to remember > that every one of the 64 characters had to have VS1 or VS2 along with it. > Constructing a table from scratch and modifying and existing one both felt > easier with uniform encoding. Yes. BUT, I would hope that chess enthusiasts would not have to think much about the encoding. Either using a special keyboard layout (momentarily) or using a palette for picking board item by board item seems to be better options. I'm sure someone will make a browser based chess editor, complete with suitable palette, and having an empty board pre-edited to start out (replacing the empty squares as pieces are laid out or moved). /Kent K From kent.karlsson14 at telia.com Mon Apr 3 21:00:45 2017 From: kent.karlsson14 at telia.com (Kent Karlsson) Date: Tue, 04 Apr 2017 04:00:45 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <421BC3D3-DF71-4D76-93E7-CAFDEDFBFFCB@evertype.com> Message-ID: Den 2017-04-04 00:35, skrev "Michael Everson" : >> What I am saying is that the glyphs for the two new variants you are >> proposing need to harmonise with the block elements such as U+2581 >> LOWER ONE EIGHTH BLOCK. > > No? in a chess font the font designer has to draw those block-element > characters differently, to harmonize with the > >> That requires uniform width *for those variants*. That is a key part of the >> glyph family's essence. > > In their original usage in graphic terminals, sure. And some people still > emulate those, and when they use those characters they draw them for that > purpose. In current ASCII-based chess-fonts, a set of characters is used to > draw a line (of one kind or another) around the board, and when I looked for > Unicode characters to map to these, the block elements were the ones that had > the right structure, since they were high and low and left and right in the em > square. > >> There is no such requirement on the glyphs for normal text use as at present. > > There is **in a chess font** if you want to be able to draw a box around the > chessboard. I'm not too happy about this. Maybe have VSs applied also to the chess box drawing chars? /Kent K From gerrietm at icloud.com Mon Apr 3 21:39:57 2017 From: gerrietm at icloud.com (Gerriet M. Denkmann) Date: Tue, 4 Apr 2017 09:39:57 +0700 Subject: Combining Class of Thai Nonspacing_Marks In-Reply-To: References: Message-ID: > On Mon, 3 Apr 2017 14:12:51 +0700 > "Gerriet M. Denkmann" wrote: > >> The Combining Class is used for normalisation of strings. >> Normalisation of strings is important for filenames in filesystems. >> >> As far as I know, a Thai consonant (Lo, Other_Letter) can have >> several Nonspacing_Marks. This cluster of nonspacing marks can >> contain at most one top/bottom vowel and at most one tone/other mark. >> There is no syntactically meaning in the order of these nonspacing >> marks. > > You're confusing the modern Thai language with the Thai script. It > seems that the Lao-style usage of NIKHAHIT as a vowel is known from > older Thai writing, and when used this way it could of course take a > tone mark. It also seems that the pressure to have both MAITAIKHU and > a tone mark on a consonant has been accepted for at least one minority > language. I stand corrected. I do know nothing about other languages written with Thai characters. So the rule should be: A consonant may have zero or one tone/other marks and also zero or one top/bottom vowels. Exceptions: NIKHAHIT + tone mark (no top/bottom vowel) MAITAIKHU + tone mark (no top/bottom vowel) The order of these has no semantical meaning. All top/bottom vowels should have Combining Class 103, other marks should have Combining Class x (with 103 < x < 107), tone marks should have Combining Class 107. Is anybody working on or is responsible for these things? Kind regards, Gerriet. From richard.wordingham at ntlworld.com Tue Apr 4 02:55:24 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Tue, 4 Apr 2017 08:55:24 +0100 Subject: Combining Class of Thai Nonspacing_Marks In-Reply-To: References:

Message-ID: <20170404085524.7a9bcfeb@JRWUBU2> On Tue, 4 Apr 2017 09:39:57 +0700 "Gerriet M. Denkmann" wrote: > So the rule should be: > > A consonant may have zero or one tone/other marks and also zero or > one top/bottom vowels. Exceptions: > NIKHAHIT + tone mark (no top/bottom vowel) > MAITAIKHU + tone mark (no top/bottom vowel) This list is not exhaustive. The order of MAITAIKHU and tone mark is significant - it should affect rendering. Formally, the Unicode Standard makes the point that the order of vowel above and tone mark is significant. > The order of these has no semantical meaning. This is true for the combination of a mark above and a mark below. For marks below, contrasting orders may be prevented (to a first approximation) by the chaos of the canonical combining classes. > All top/bottom vowels should have Combining Class 103, > other marks should have Combining Class x (with 103 < x < 107), > tone marks should have Combining Class 107. > > Is anybody working on or is responsible for these things? Unicode combining classes cannot be changed. All that can be done is to enforce the order of characters in normalised text. Asmus Freytag has been working on an extreme version of that that disallows minority languages in certain parts of domain names, and there is some pressure to start using dotted circles in rendering so as to punish transgressors, counterbalanced by the feeling that one shouldn't be suppressing minority languages. Marshall Phibun's jackboots are getting some exercise. There is some input checking, loosely based on WTT (Wing Thuk Thi). This may be implemented in such a way as to support the prohibition of minority languages. Richard. From duerst at it.aoyama.ac.jp Tue Apr 4 03:28:19 2017 From: duerst at it.aoyama.ac.jp (=?UTF-8?Q?Martin_J._D=c3=bcrst?=) Date: Tue, 4 Apr 2017 17:28:19 +0900 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: Message-ID: On 2017/04/03 23:41, Kent Karlsson wrote: > Hence the chess board lines should be displayed in a strong left-to-right > context (either via bidi markup characters, or via some higher order > bidi markup mechanism, such as the "bidi" attribute in HTML). Though in > most cases (not Arabic/Hebrew/... document), the bidi context will default > to left-to right... There never was a "bidi" attribute in HTML. You probably mean the "dir" attribute. Regards, Martin. From verdy_p at wanadoo.fr Tue Apr 4 08:00:07 2017 From: verdy_p at wanadoo.fr (Philippe Verdy) Date: Tue, 4 Apr 2017 15:00:07 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <915358C1-319D-4494-A915-2FAA557F8840@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> <742647d6-75f8-2f59-4b60-75a67ea73572@ix.netcom.com> <5062E7FE-57DA-49A7-89C1-776D6CDE2E61@evertype.com> <915358C1-319D-4494-A915-2FAA557F8840@evertype.com> Message-ID: 2017-04-04 1:30 GMT+02:00 Michael Everson : > On 3 Apr 2017, at 23:07, Asmus Freytag (c) wrote: > > > > On 4/3/2017 2:15 PM, Michael Everson wrote: > >> On 3 Apr 2017, at 17:16, Asmus Freytag wrote: > >> > >>>>> The same indirection is at play here. > >>>>> > >>>> This is pure rhetoric, Asmus. It addresses the problem in no way. > >>>> > >>> Actually it does. I'm amazed that you don't see the connection. > >>> > >> I?ve never understood you when you back up into that particular kind of > abstract rhetoric. > > > > Sometimes thinking through something in abstract terms actually > clarifies the situation. > > Of course I know that?s your view. It?s just never been an effective > communication strategy between you and me generally. > > >>> The ?meaning? of a chess-problem matrix is the whole 8 ? 8 board, not > the empty dark square at b4 or the white pawn on > > > > In other words, you assert that partial boards never need to be > displayed. (Let's take that as read, then). > > No, I am sure that a variety of board shapes can be set in plain text with > these conventions, though the principle concern is classical chess notation. > > >> The ?problem? the higher-level protocol is supposed to solve is the one > where a chess piece of one colour sits in an em-squared zone whether light > or dark. In lead type this was a glyph issue. Lead type had just exactly > what my proposal has: A piece with in-line text metrics, spaced > harmoniously with digits and letters, and square sorts with and without > hatching. > > > > Leaving aside the abstract question whether modeling lead type is ipso > facto the best solution in all cases? > > I think it was a good expedient solution in lead type and that this > proposal offers a robust parseable digital version of that solution, and I > assert people will make use of that data structure. > > >> OK, then you support the part of the proposal that applies VS1 and VS2 > to the chess pieces. > > > > My statement just was that a proposal where piece + VS should be > M-square, piece w/o VS should be generic, might make some sense (and same > for a suitable "empty" cell). > > > > The next question would be whether the alternation in background is best > expressed in variation sequences or by some other means. > > I think the value in the data structures I have described is best retained > as text. Anything else just seems it would be simply needlessly complex, > > > If you never need to show just a single field, then I concede that the > main drawback of variation selectors for the background style is absent; > however, reading ahead in your message, the partial grid appears to be > common, therefore the reason to choose an alternate solution to the > background style is a strong one. > > Well, it?s text, Asmus, so you can delete all but one line of a board if > you want: > > ?????????????????? > > There. So? what are you talking about? It?s a text matrix. It?s like a > kind of poem. > > ?????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????????????? > ?????????? > > It even looks like one. That?s a meaningful pattern. A kind of writing > system. > For me it looks like ASCII art, a hack mixing various characters intended for different uses and ignoring all semantics, only working because it reuses similar-looking glyphs instead of being an actual encoding. That represetnation is absultely not semantically coherent. If we want to have true checkboard cells, we need characters specifically for them, and in them we'll place (or not) chess pieces or any other suitable symbol or letter. This means creating clusters (cell+ZWJ+piece). This will be coherent. If we want to have borders for boards, we need coherent characters for them (we do not expct them to be combined with pieces, just that they will properly glue with cells in the middle of the board, and that their metric match them in suitable fonts). The fact that legacy renderers or fonts won't display that correctly is definitely not an argument. Many scripts still have problems being represented with legacy renderers or fonts. But the encoding is made to be coherent semantically. Fonts and rederers will adapt their properties to render what is semantically wanted and that will be also pleasing to read, and they still will be able to use various variants (e.g. emoji styles for pieces, possibly with 3D effects and colors, possibly animated pieces, or alternate decorative patterns in board cells, possibly photographic-based, such as wood, marble, grass, sand, glass, iron...) -------------- next part -------------- An HTML attachment was scrubbed... URL: From verdy_p at wanadoo.fr Tue Apr 4 08:11:17 2017 From: verdy_p at wanadoo.fr (Philippe Verdy) Date: Tue, 4 Apr 2017 15:11:17 +0200 Subject: Tags and custom vector glyph emoji (from Re: Tailoring the Marketplace (is: Re: Unicode Emoji 5.0 characters now final)) In-Reply-To: <12034663.23788.1491301116912.JavaMail.defaultUser@defaultHost> References: <11364706.56745.1491240615392.JavaMail.root@webmail12.bt.ext.cpcloud.co.uk> <7794797.61788.1491243181509.JavaMail.defaultUser@defaultHost> <12034663.23788.1491301116912.JavaMail.defaultUser@defaultHost> Message-ID: 2017-04-04 12:18 GMT+02:00 William_J_G Overington : > > ... developers prefer investing time in SVG renderers or existing font > technologies for OpenType (SVG fonts will come later when it will be > capable of doing the same things as OpenType, for now it does not cover all > the existing needs). > > Well, I do not know what developers prefer. There seems to be a need to > send custom emoji in interoperable Unicode plain text and I have put > forward an idea for how to do it. > You just know what you isolately prefer: can't you see that what you propose is even less powerfull than a **STANDARD** SVG path ? it already has eveything you 'propose", except that it is already widely implemented and developers will prefer reuing them directly. A SVG path looks like "M100,100h800v800h-800z" to draw a square 800-sized centered in a 1000-sized square, there's no need for "x" or "y", there are shortcuts already defined for horizontal or vertical strokes (using relative or absolute coordinates) and path closure, and it supports straight segments, cubic and quadratic splines and elliptic arcs. Its internal "machine" is very well documented (with extensive conformance tests for renderers, including for all supported geometric transforms and conversion of paths for creating stroke styles instead of filling them directly). -------------- next part -------------- An HTML attachment was scrubbed... URL: From otto.stolz at uni-konstanz.de Tue Apr 4 08:21:02 2017 From: otto.stolz at uni-konstanz.de (Otto Stolz) Date: Tue, 4 Apr 2017 15:21:02 +0200 Subject: Encoding of old compatibility characters In-Reply-To: <83fuht6fqg.fsf@gnu.org> References: <92ba6970-86e1-5d80-e3c9-239283a384b0@gmail.com> <41b2170a-6efb-518d-8c02-3881fbb09bae@kli.org> <2ba990ce-9d57-4e8b-b4dd-e9f1a821cd3b@gmail.com> <4q7f39oed2.fsf@chem.ox.ac.uk> <2d2b2a87-f4d8-7f28-59de-f6cf7437c9c5@ix.netcom.com> <7e7af7d6-dfc4-159a-832f-e60f24136b0f@gmail.com>

<83fuht6fqg.fsf@gnu.org> Message-ID: Am 31.03.2017 um 09:57 schrieb Eli Zaretskii: > Arial Unicode MS supports that character [U+23E8], FWIW. Not on my good ole Wndows XP SP3 system. Best wishes, Otto From eliz at gnu.org Tue Apr 4 09:58:33 2017 From: eliz at gnu.org (Eli Zaretskii) Date: Tue, 04 Apr 2017 17:58:33 +0300 Subject: Encoding of old compatibility characters In-Reply-To: (message from Otto Stolz on Tue, 4 Apr 2017 15:21:02 +0200) References: <92ba6970-86e1-5d80-e3c9-239283a384b0@gmail.com> <41b2170a-6efb-518d-8c02-3881fbb09bae@kli.org> <2ba990ce-9d57-4e8b-b4dd-e9f1a821cd3b@gmail.com> <4q7f39oed2.fsf@chem.ox.ac.uk> <2d2b2a87-f4d8-7f28-59de-f6cf7437c9c5@ix.netcom.com> <7e7af7d6-dfc4-159a-832f-e60f24136b0f@gmail.com>

<83fuht6fqg.fsf@gnu.org> Message-ID: <838tngp6cm.fsf@gnu.org> > From: Otto Stolz > Date: Tue, 4 Apr 2017 15:21:02 +0200 > > Am 31.03.2017 um 09:57 schrieb Eli Zaretskii: > > Arial Unicode MS supports that character [U+23E8], FWIW. > > Not on my good ole Wndows XP SP3 system. This here is also XP SP3. Maybe some package I have installed updated the font? From wjgo_10009 at btinternet.com Tue Apr 4 05:18:36 2017 From: wjgo_10009 at btinternet.com (William_J_G Overington) Date: Tue, 4 Apr 2017 11:18:36 +0100 (BST) Subject: Tags and custom vector glyph emoji (from Re: Tailoring the Marketplace (is: Re: Unicode Emoji 5.0 characters now final)) In-Reply-To: References: <11364706.56745.1491240615392.JavaMail.root@webmail12.bt.ext.cpcloud.co.uk> <7794797.61788.1491243181509.JavaMail.defaultUser@defaultHost> Message-ID: <12034663.23788.1491301116912.JavaMail.defaultUser@defaultHost> Philippe Verdy wrote: > What you are describing is reinventing the wheel, notably basically what SVG paths already define. Well, I am trying to express, within a tag sequence that could be included in an interoperable Unicode plain text message, the glyph information for one emoji glyph of an OpenType colour font. I have not included anything about SVG. > Font encoding technologies define their own system using multiple tables and a compact dictionnary of tables with binary encoding, not suitable for inclusion in plain-text. Yes, that is why I have devised this format, so that the glyph information for one emoji glyph of an OpenType colour font could be included in a Unicode plain text message. > Note also that Emojis could be animated when rendered on screen (that's what we already see in many implementations using GIF icons for their emojis, even if they are not easily resizable). Animated SVG for now is still in beta but starts being used on some sites and rendered by web browsers. SVG images may also be scripted and may include accessbility feature (e.g. with sound played or hint bubbles displayed when hovering them). The format that I suggested could be extended if desired. For example, h is for an unanimated glyph. There could be added q and e if desired, so that instead of h one uses q for completing the glyph for each frame, and then e to export the complete animated glyph. For example, as follows. q means {define a complete glyph of advance width w from the glyph or glyphs in the glyphs buffer and place it in the animation buffer; reset everything except the animation buffer ready to define the next glyph in the animation;} e means {produce an animated glyph from the contents of the animation buffer ready for access by the main program; halt;} Yes, accessibility features are important and I will try to think about including them. Readers are welcome to make suggestions as to what is needed. > You only cover a part of what is needed .... Well, yes, I suppose so, yet what I have published could get something started and anything else that is needed could be added, either by me or by the Unicode Technical Committee and the Emoji Subcommittee if people are interested in implementing the idea. > .... but hope that someone will invest time to implet it in a renderer: Well, yes eventually. I am hoping that the idea will be discussed in the mailing list and then go forward to the Emoji Subcommittee and then go to the Unicode Technical Committee and then become part of The Unicode Standard and then be used by people. Many people think of new encoding ideas and put them forward to the Unicode Technical Committee, sometimes starting with a post in this mailing list before a formal submission in the hope that the discussion will be helpful. Such discussion often improves the formal submission. That is the process, the way that Unicode progresses. > ... developers prefer investing time in SVG renderers or existing font technologies for OpenType (SVG fonts will come later when it will be capable of doing the same things as OpenType, for now it does not cover all the existing needs). Well, I do not know what developers prefer. There seems to be a need to send custom emoji in interoperable Unicode plain text and I have put forward an idea for how to do it. William Overington Tuesday 4 April 2017 From asmusf at ix.netcom.com Tue Apr 4 11:51:45 2017 From: asmusf at ix.netcom.com (Asmus Freytag) Date: Tue, 4 Apr 2017 09:51:45 -0700 Subject: Encoding of old compatibility characters In-Reply-To: <838tngp6cm.fsf@gnu.org> References: <92ba6970-86e1-5d80-e3c9-239283a384b0@gmail.com> <41b2170a-6efb-518d-8c02-3881fbb09bae@kli.org> <2ba990ce-9d57-4e8b-b4dd-e9f1a821cd3b@gmail.com> <4q7f39oed2.fsf@chem.ox.ac.uk> <2d2b2a87-f4d8-7f28-59de-f6cf7437c9c5@ix.netcom.com> <7e7af7d6-dfc4-159a-832f-e60f24136b0f@gmail.com>

<83fuht6fqg.fsf@gnu.org> <838tngp6cm.fsf@gnu.org> Message-ID: <3cf59c63-ee7e-a805-d8d3-84b1597b20e7@ix.netcom.com> An HTML attachment was scrubbed... URL: From asmusf at ix.netcom.com Tue Apr 4 11:55:04 2017 From: asmusf at ix.netcom.com (Asmus Freytag) Date: Tue, 4 Apr 2017 09:55:04 -0700 Subject: Combining Class of Thai Nonspacing_Marks In-Reply-To: References:

Message-ID: An HTML attachment was scrubbed... URL: From mark at macchiato.com Tue Apr 4 11:58:18 2017 From: mark at macchiato.com (=?UTF-8?B?TWFyayBEYXZpcyDimJXvuI8=?=) Date: Tue, 4 Apr 2017 18:58:18 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> <742647d6-75f8-2f59-4b60-75a67ea73572@ix.netcom.com> <5062E7FE-57DA-49A7-89C1-776D6CDE2E61@evertype.com> <915358C1-319D-4494-A915-2FAA557F8840@evertype.com> Message-ID: Amusing at this is, hard to believe that people are spending this much time on an April Fool's posting. I'm looking forward to similar postings on checkers and go pieces. As a matter of fact, one that proposes adding new characters for every possible configuration of a go board would be imaginative. And I'm looking also forward to the ?+ZWJ+?? (etc) proposal. Mark Mark On Tue, Apr 4, 2017 at 3:00 PM, Philippe Verdy wrote: > > > 2017-04-04 1:30 GMT+02:00 Michael Everson : > >> On 3 Apr 2017, at 23:07, Asmus Freytag (c) wrote: >> > >> > On 4/3/2017 2:15 PM, Michael Everson wrote: >> >> On 3 Apr 2017, at 17:16, Asmus Freytag wrote: >> >> >> >>>>> The same indirection is at play here. >> >>>>> >> >>>> This is pure rhetoric, Asmus. It addresses the problem in no way. >> >>>> >> >>> Actually it does. I'm amazed that you don't see the connection. >> >>> >> >> I?ve never understood you when you back up into that particular kind >> of abstract rhetoric. >> > >> > Sometimes thinking through something in abstract terms actually >> clarifies the situation. >> >> Of course I know that?s your view. It?s just never been an effective >> communication strategy between you and me generally. >> >> >>> The ?meaning? of a chess-problem matrix is the whole 8 ? 8 board, not >> the empty dark square at b4 or the white pawn on >> > >> > In other words, you assert that partial boards never need to be >> displayed. (Let's take that as read, then). >> >> No, I am sure that a variety of board shapes can be set in plain text >> with these conventions, though the principle concern is classical chess >> notation. >> >> >> The ?problem? the higher-level protocol is supposed to solve is the >> one where a chess piece of one colour sits in an em-squared zone whether >> light or dark. In lead type this was a glyph issue. Lead type had just >> exactly what my proposal has: A piece with in-line text metrics, spaced >> harmoniously with digits and letters, and square sorts with and without >> hatching. >> > >> > Leaving aside the abstract question whether modeling lead type is ipso >> facto the best solution in all cases? >> >> I think it was a good expedient solution in lead type and that this >> proposal offers a robust parseable digital version of that solution, and I >> assert people will make use of that data structure. >> >> >> OK, then you support the part of the proposal that applies VS1 and VS2 >> to the chess pieces. >> > >> > My statement just was that a proposal where piece + VS should be >> M-square, piece w/o VS should be generic, might make some sense (and same >> for a suitable "empty" cell). >> > >> > The next question would be whether the alternation in background is >> best expressed in variation sequences or by some other means. >> >> I think the value in the data structures I have described is best >> retained as text. Anything else just seems it would be simply needlessly >> complex, >> >> > If you never need to show just a single field, then I concede that the >> main drawback of variation selectors for the background style is absent; >> however, reading ahead in your message, the partial grid appears to be >> common, therefore the reason to choose an alternate solution to the >> background style is a strong one. >> >> Well, it?s text, Asmus, so you can delete all but one line of a board if >> you want: >> >> ?????????????????? >> >> There. So? what are you talking about? It?s a text matrix. It?s like a >> kind of poem. >> >> ?????????? >> ?????????????????? >> ?????????????????? >> ?????????????????? >> ?????????????????? >> ?????????????????? >> ?????????????????? >> ?????????????????? >> ?????????????????? >> ?????????? >> >> It even looks like one. That?s a meaningful pattern. A kind of writing >> system. >> > > For me it looks like ASCII art, a hack mixing various characters intended > for different uses and ignoring all semantics, only working because it > reuses similar-looking glyphs instead of being an actual encoding. > That represetnation is absultely not semantically coherent. > > If we want to have true checkboard cells, we need characters specifically > for them, and in them we'll place (or not) chess pieces or any other > suitable symbol or letter. This means creating clusters (cell+ZWJ+piece). > This will be coherent. > > If we want to have borders for boards, we need coherent characters for > them (we do not expct them to be combined with pieces, just that they will > properly glue with cells in the middle of the board, and that their metric > match them in suitable fonts). > > The fact that legacy renderers or fonts won't display that correctly is > definitely not an argument. Many scripts still have problems being > represented with legacy renderers or fonts. But the encoding is made to be > coherent semantically. Fonts and rederers will adapt their properties to > render what is semantically wanted and that will be also pleasing to read, > and they still will be able to use various variants (e.g. emoji styles for > pieces, possibly with 3D effects and colors, possibly animated pieces, or > alternate decorative patterns in board cells, possibly photographic-based, > such as wood, marble, grass, sand, glass, iron...) > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From everson at evertype.com Tue Apr 4 12:47:06 2017 From: everson at evertype.com (Michael Everson) Date: Tue, 4 Apr 2017 18:47:06 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <7A9A7F35-3F4E-4C38-AA36-136399111271@evertype.com> <742647d6-75f8-2f59-4b60-75a67ea73572@ix.netcom.com> <5062E7FE-57DA-49A7-89C1-776D6CDE2E61@evertype.com> <915358C1-319D-4494-A915-2FAA557F8840@evertype.com> Message-ID: On 4 Apr 2017, at 17:58, Mark Davis ?? wrote: > Amusing at this is, hard to believe that people are spending this much time on an April Fool's posting. I wondered how long it would take for someone to be taken in. The joke, of course, was hidden not inside the proposal, but inside the date. > I'm looking forward to similar postings on checkers You haven?t bothered to read the proposal, have you? > and go pieces. G? notation is rather different and this kind of solution might not be appropriate for it. That, however, is a different problem unrelated to this proposal. > As a matter of fact, one that proposes adding new characters for every possible configuration of a go board would be imaginative. You really haven?t bothered to read the proposal, have you? > And I'm looking also forward to the ?+ZWJ+?? (etc) proposal. I recommend that you read the proposal before attempting to dismiss it. Michael Everson PS. Interested readers may wish to review some other proposals by myself and others. N4014 2011-04-01 was successful N4012 2011-04-01 was successful N4011 2011-04-01 was not successful* N3412 2008-04-01 was not successful N3066 2006-04-01 was successful N2935 2005-04-01 was successful N258A 2003-04-01 was not successful N2338 2001-04-01 was successful N2326 2001-04-01 was not successful *Though given recent symbol work by some it might be prudent to revive some part of this one. PSS: While games like chess, draughts, g?, and xi?ngq? are pastimes, they are also complex intellectual pursuits which have amassed a sizeable literature over many centuries. Chess notation and chess diagrams is a good example. Kifu notation for g? is another. The UCS encodes characters which represent the pieces of many games. It is reasonable to expect that people may wish to use these characters to represent game data. Asmus? idea that the 12 chess characters be duplicated or triplicated in order to set chess diagrams is wasteful of encoding space and not extensible either. We have seen that some 84 additional chess characters have been proposed; it would be a very bad idea to expand that to 168 or 252 characters. The appropriate way to respond to the great many differences in the ASCII-encoded existing chess fonts is to simply make use of existing characters in the standard to alter, in a systematic and standardized way, the glyph representation of the 12 already-encoded characters with 2 other already-encoded characters, as described in the proposal. Years ago a proposal similar to Asmus? was made, in discussion if not in a formal document. The answer was ?a higher level protocol would be best for chessboard notation?. Well, the simplest higher-level protocol for this is to use variation selectors to alter the font display, just as we use them for DIGIT ZERO, 16 Myanmar letters, INTERSECTION, UNION, SUBSET OF WITH NOT EQUAL TO, a bunch of other mathematical characters and more than 300 pictographs. Michael Everson From irgendeinbenutzername at gmail.com Tue Apr 4 12:53:29 2017 From: irgendeinbenutzername at gmail.com (Charlotte Buff) Date: Tue, 4 Apr 2017 19:53:29 +0200 Subject: Emoji Compatibility Symbols Message-ID: I am trying to reconstruct what the 66 emoji compatibility symbols that were included in some old drafts originally mapped to, but useful information on the web seems a bit sparse. It was fairly easy to figure out that compatibility symbols 1 through 16 eventually became proper characters (or sequences) and turned into ??, ??, ??, ??, ??, ????, ????, ????, ????, ????, ????, ????, ????, ????, ????, and ?. However, that still leaves 50 symbols that don't correspond to any Unicode characters. I did find this project that assigned names to private-use codepoints, and the related mappings from those codepoints to the different carrier sets . Unfortunately, I still don?t know what images or meanings were associated with those numbers. Searching for SoftBank emoji gave me a neatly organized list of 404 errors and KDDI was equally fruitless. Documents on the Unicode website itself regularly mention that EMS 17 through 66 are needed for round-trip mappings but never what these mappings actually were as far as I could find. Does anybody have this information available? -------------- next part -------------- An HTML attachment was scrubbed... URL: From richard.wordingham at ntlworld.com Tue Apr 4 12:54:31 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Tue, 4 Apr 2017 18:54:31 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <20170403220348.3efb4d1a@JRWUBU2> <421BC3D3-DF71-4D76-93E7-CAFDEDFBFFCB@evertype.com> <20170404004701.19ad750c@JRWUBU2> <20170404185431.07dbe483@JRWUBU2> Message-ID: <07DD2CE0-5510-49A3-883A-EF7A1A34C80E@evertype.com> On 4 Apr 2017, at 18:54, Richard Wordingham wrote: > > On Tue, 4 Apr 2017 01:30:05 +0100 > Michael Everson wrote: > >>> I'm trying to work out whether we need a variation sequence for "chesspiece in a sentence?. >> >> Of course! Haven?t you ever seen chess problem texts? Check out the Fairy Chess proposal for encoding additional characters. Plenty of examples there. > > Your examples did not have to contend with the possibility of fonts that only support the variants for drawing chessboards. Um, what? Why would anyone make a font that supports the variants for drawing chessboards (which require the encoded characters 2654..265F) not put in glyphs for those? FontLab is the program I use to add OpenType features to my fonts, and if I try to add a sequence like 2654 + FE00 and the font doesn?t have a 2654, if flags it as an error and insists that the character appear in the font. OK, someone could be perverse and not add glyphs to those code positions, but? But nobody making a chess font with actual support for chess would do that. So this is another red herring. As far as I can see, your worries are groundless, and nothing has suggested that there?s something wrong with the proposal. Also, having implemented it in three or four different fonts now, I find that it works. It does the job, and it?s easy to use to edit. >> Sorry, I meant ?Of course **not**!? that is, chesspiece in a sentence is extremely common, and should be the default (not stylized) form. We can?t repurpose that to be ?chesspiece on a white square? because it hasn?t been previously and changing that would affect the layout of existing data. > > But would not your proposal make it legitimate for a font to supply only chess pieces on dark backgrounds for the chess piece characters? What does ?legitimate? mean? Nothing prevents someone from drawing the 16 Myanmar base characters with rings at the ends of their glyphs even though now VS are being recommended for that presentation. Is it legitimate to do that? Of course it is. It?s legitimate to make Myanmar fonts with square glyphs rather than circular ones. This proposal provides a stable encoding model for drawing chessboards simply, with fonts. Currently there are other fonts which do this, but they do not share encodings, and so sharing chessboard data is dependent on whether you have set up your board in the same font encoding that somebody else is using. Otherwise it doesn?t work, and your text is corrupt and you have to re-key various elements in order to use the glyphs of the other font. This problem is described in detail at the beginning of the proposal. It is the same problem we had with ISO/IECE 8859-1, -2, -3. -4 etc before we had the UCS. So: we have unstable non-Unicode encodings for chessboards now, this proposal provides stable Unicode encodings. This can only benefit the community of users of chess fonts. Anybody who isn?t setting chessboards is unaffected, just as I am unaffected by variation selectors used for glyph variation in mathematical fonts. (I might add the slashed zero glyph to Everson Mono, though.) This proposal does this while leaving the base characters alone so they can be used as chesspieces in text (as they have been since Unicode 1.1) and by adding a mechanism to construct the glyphs necessary for presenting chessboard data. This proposal uses a mechanism which has already been used for dozens of regular characters and 310 times for some popular pictographs. No new characters need to be added. Just a list of items in a text file. Can you identify an actual problem? Michael Everson From gwalla at gmail.com Tue Apr 4 21:41:25 2017 From: gwalla at gmail.com (Garth Wallace) Date: Tue, 4 Apr 2017 19:41:25 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170404004701.19ad750c@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

> > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From everson at evertype.com Wed Apr 5 09:49:41 2017 From: everson at evertype.com (Michael Everson) Date: Wed, 5 Apr 2017 15:49:41 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <20170405091030.116883a5@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<83fuht6fqg.fsf@gnu.org> <838tngp6cm.fsf@gnu.org> <3cf59c63-ee7e-a805-d8d3-84b1597b20e7@ix.netcom.com> Message-ID: <6152360a-6df1-d8b2-786a-aa54c7a843f4@uni-konstanz.de> Helo, Am 31.03.2017 um 09:57 schrieb Eli Zaretskii: > Arial Unicode MS supports that character [U+23E8], FWIW. From: Otto Stolz Date: Tue, 4 Apr 2017 15:21:02 +0200 > Not on my good ole Wndows XP SP3 system. On 4/4/2017 7:58 AM, Eli Zaretskii wrote: > This here is also XP SP3. Maybe some package I have installed updated > the font? Am 04.04.2017 um 18:51 schrieb Asmus Freytag: > AFAIK, this font is / was installed by MS Office. I have got MS Word 2002 and MS Excel 2000. Maybe, later versions bring an amended version of Arial Unicode MS. Cheers, otto From everson at evertype.com Wed Apr 5 10:21:30 2017 From: everson at evertype.com (Michael Everson) Date: Wed, 5 Apr 2017 16:21:30 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <20170403220348.3efb4d1a@JRWUBU2> <421BC3D3-DF71-4D76-93E7-CAFDEDFBFFCB@evertype.com> <20170404004701.19ad750c@JRWUBU2> <175BB7CB-08BA-4E80-8337-7EBCCB90B141@evertype.com> Message-ID: <7E2DEDE2-A1D1-4937-85F6-DD0AB6432431@evertype.com> On 5 Apr 2017, at 15:52, Garth Wallace wrote: > [?] I'm just saying that if having symbols without VS not match either of the VSes is a sticking point, it's not hard to work around. Oh, I see. ?? Well, yes, I agree with you in part. But here?s the thing. It is *permissible* for proportional-inline-chesspieces to be identical to emsquare-chessboard-chesspiece if a designer *wants* to do it that way. But it is *just* as permissible for proportional-inline-chesspieces to be truly proportional and unsuitable for chessboard typesetting (and that?s how it has been since Unicode 1.1). Look, here is a choice: U+2654 - WHITE CHESS KING whose width might or might not be U+2654 FE00 - WHITE CHESS KING whose glyph is a white/light em-square for chessboards U+2654 FE01 - WHITE CHESS KING whose glyph is a black/dark em-square for chessboards I think this is enough. Or it could be: U+2654 - WHITE CHESS KING whose width might or might not be U+2654 FE00 - WHITE CHESS KING whose glyph is the same as the unmodified U+2654, whatever it is U+2654 FE01 - WHITE CHESS KING whose glyph is a white/light em-square for chessboards U+2654 FE02 - WHITE CHESS KING whose glyph is a black/dark em-square for chessboards There?s some precedent for this, where some symbols have one VS for ?text glyph? and a different VS for ?emoji glyph? and of course the unmodified symbol can be used and will display as the font has it. I don?t think the second is necessary. It?s not necessary for this, for example: U+0030 - DIGIT ZERO U+0030 FE00 - short diagonal stroke form U+0030 FE0E - text style U+0030 FE0F - emoji style OK, ?text style? is identical to unmodified U+0030, but the only reason that attribute exists is in distinction to ?emoji style?. Compare also: U+1000 - MYANMAR LETTER KA U+1000 FE00 - dotted form >>> Currently, chess fonts can be (roughly) divided into "diagram fonts" and "notation fonts?. >> >> That?s not true. There are some which do all three. > > There are, sure. I said roughly: many don't do both & rely on font-switching. But even more of them can?t rely on font-switching because the encoding of the piece on light and dark chessboard varies from supplier to supplier. All current chess fonts are ASCII hacks. >>> None of the features required for a diagram font are unacceptable in figurine notation: >> >> The white ones may be too wide for use in text. > > Not visually ideal, but legible. Yes but if we were to unify unmodified chesspieces with the pieces on white squares it could invalidate the metrics of text like http://evertype.com/standards/unicode-list/34-variantim.png As I say, it?s *permissible* to have the unmodified chesspiece glyph be the same as the white-square chesspiece glyph, but it?s not obligatory, and we must preserve font designer choice here. Michael Everson From asmusf at ix.netcom.com Wed Apr 5 10:21:51 2017 From: asmusf at ix.netcom.com (Asmus Freytag (c)) Date: Wed, 5 Apr 2017 08:21:51 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <30044668.29483.1491394966847.JavaMail.defaultUser@defaultHost> References: <4919039.28328.1491394049217.JavaMail.root@webmail43.bt.ext.cpcloud.co.uk> <30044668.29483.1491394966847.JavaMail.defaultUser@defaultHost> Message-ID: On 4/5/2017 5:22 AM, William_J_G Overington wrote: > Asmus Freytag wrote: > >> .... - relying solely on ligatures has the benefit of not involving the UTC at all, therefore it could be implemented today without delay). > I am wondering whether that is correct. > > Where one implements a ligature using a ZWJ without the Unicode Technical Committee having agreed then that is fine where the meaning of the text is unchanged: for example, if one chooses to include, say, a pp ligature in a font. > > Yet to implement a ligature using a ZWJ where the meaning is changed, then I am wondering whether that needs the agreement of the Unicode Technical Committee. > > There have been some recent encodings where ZWJ has been used with two or more emoji characters to produce a new emoji character where the meaning of the result is different from the combined meanings of the ingredients, the meaning of that new character not always or maybe never being congruently obvious unless one already knows the meaning. > > If a ZWJ encoding for producing chess diagrams were to be introduced, then if it is not UTC that decides the detail, then who does decide? Would a non-UTC decision be interoperable, would it be supported? There's no need to use a ZWJ, because there's no existing other use of a square before a chess piece that needs to be preserved. A./ PS: I assume it's safe to ignore the rest of your message, being based on a wrong premise? -------------- next part -------------- An HTML attachment was scrubbed... URL: From asmusf at ix.netcom.com Wed Apr 5 10:25:33 2017 From: asmusf at ix.netcom.com (Asmus Freytag) Date: Wed, 5 Apr 2017 08:25:33 -0700 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<83fuht6fqg.fsf@gnu.org> <838tngp6cm.fsf@gnu.org> <3cf59c63-ee7e-a805-d8d3-84b1597b20e7@ix.netcom.com> <6152360a-6df1-d8b2-786a-aa54c7a843f4@uni-konstanz.de> Message-ID: <2e0e609c-5516-f055-cd85-05c5fbe65963@ix.netcom.com> > I have got MS Word 2002 and MS Excel 2000. > Maybe, later versions bring an amended version of Arial Unicode MS. Maybe. A./ > > > From everson at evertype.com Wed Apr 5 10:37:38 2017 From: everson at evertype.com (Michael Everson) Date: Wed, 5 Apr 2017 16:37:38 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <57079982-d48c-69aa-6195-20fe08b332e3@ix.netcom.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

<20170402095322.17526d87@JRWUBU2> <16625622-2E1B-4053-A428-CAF97F7916F3@evertype.com> <20170402172710.54c37ad2@JRWUBU2> <2e5750ee-c110-2b15-7e7e-cfc166167ba8@ix.netcom.com> <20170403203355.6cbfc184@JRWUBU2> <20170405091030.116883a5@JRWUBU2> Message-ID: <762B29A1-3161-41DD-82C6-3CB07B86EC91@evertype.com> On 5 Apr 2017, at 11:05, Asmus Freytag wrote: > Actually, I'm now leaning towards a preference for any scheme that does not use VS, but relies on ligatures. This would make editing the text more difficult and would yield less legible results in environments where the ligatures aren?t supported. > Such a scheme would need > a) no matching spacing for the bare pieces (the ligature with the empty square would result in the correct spacing) Well, that?s no different at all than my scheme except you ligate pawn and empty square as I ligate pawn and VS. But your scheme has the disadvantage of being similar to the emoji sequences, which would appear to require ZWJ between the pawn and the empty square. That means you have more characters to deal with and in fact you end up with variable length chessboard lines, which yields the worst possible results in fallback. > b) no pieces with built-in dark background (pieces simply ligate with the empty "black" square). Or as I have it, pawn and VS. >> Now, what happens to the two scheme if rendered with yellow text ('foreground') on a blue background? > > According to Michael, the effect should be that of lead typography. Well that?s not really what I was talking about with lead typography. (That?s more the ASCII-art argument.) > This would mean that the entire ligature has the same ink color, and all parts that are not "ink" are the background color (paper color). Yes, paper and ink. As in http://evertype.com/standards/unicode-list/looking-glass-yellow-blue.png > Unlike lead typography, the ink can be perfectly opaque, allowing a lighter color to show on a dark background. Or the opacity of the foreground can be selected to an intermediate level, allowing the ink to look greenish in your example. In any case this is a red herring. > (The results with a VS based system are not really different, because I imagine, the actual glyph repertoire is identical in all alternatives discussed so far - relying solely on ligatures has the benefit of not involving the UTC at all, therefore it could be implemented today without delay). Except that ligatures is problematic for actually making chessboards. The risk that fallback becomes illegible is hugely magnified. Here: http://evertype.com/standards/unicode-list/ligation-vs-VS.png On the left we have your scheme, shown in a mono-width font; on the right, mine. Ligation, in fallback will lead to variable-width text on each of the eight lines, which will differ depending on how many chess pieces or none appear. With the VS solution, *all* chess data will have the same number of characters in each line. In fact, parsers could identify misplaced VS characters (VS1 where VS2 would have to be there) or missing ones. Moreover, reverse-parsers (or whatever the term could be) could take narrative text data as in: http://evertype.com/standards/unicode-list/34-variantim.png and generate tables from it (if the narrative data were well-formed). All the UTC has to do is approve the set of VS sequences as a *standardized* way of doing this. Ad-hoc ligation is just going to lead to continued chaos, as well as continued dependence on differently-encoded ASCII fonts. Michael Everson From beckiergb at gmail.com Wed Apr 5 11:42:25 2017 From: beckiergb at gmail.com (Rebecca Bettencourt) Date: Wed, 5 Apr 2017 09:42:25 -0700 Subject: PETSCII mapping? In-Reply-To: <38d70a68-aabe-a6d1-50cf-cbdf2f92b88f@ix.netcom.com> References: <38d70a68-aabe-a6d1-50cf-cbdf2f92b88f@ix.netcom.com> Message-ID: On Wed, Apr 5, 2017 at 3:18 AM, Asmus Freytag wrote: > Unicode is not an archive of anything ever used on computers. > Why not? Isn't one of Unicode's goals to support the conversion of documents using legacy character sets into Unicode? I do not understand why, say, the entire IBM PC character set is eligible for encoding, but not the entire Commodore 64 character set. Were there word processors on the Commodore 64 that allowed the input of PETSCII characters? Could documents written using that software demonstrate a need to encode those characters? What about instruction manuals, magazine articles, and program listings that used PETSCII characters in running text? Surely there must be more than enough examples for a computer as popular as the Commodore 64. -------------- next part -------------- An HTML attachment was scrubbed... URL: From wjgo_10009 at btinternet.com Wed Apr 5 11:28:04 2017 From: wjgo_10009 at btinternet.com (William_J_G Overington) Date: Wed, 5 Apr 2017 17:28:04 +0100 (BST) Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4919039.28328.1491394049217.JavaMail.root@webmail43.bt.ext.cpcloud.co.uk> <30044668.29483.1491394966847.JavaMail.defaultUser@defaultHost> Message-ID: <10737369.47241.1491409684891.JavaMail.defaultUser@defaultHost> Asmus Freytag wrote: > There's no need to use a ZWJ, because there's no existing other use of a square before a chess piece that needs to be preserved. Well, whether there is a need to use a ZWJ or no need to use a ZWJ is not here the issue. Asmus wrote before: > > > .... - relying solely on ligatures has the benefit of not involving the UTC at all, therefore it could be implemented today without delay). I then asked, the question worded differently from how it is worded here, about whether UTC needs to be involved where a character sequence that contains one or more ZWJ characters generates a glyph with a meaning different from the meaning of the original sequence that did not have the one or more ZWJ characters included. For example, p ZWJ p produces a pp ligature with no change of meaning. For example, where WOMAN ZWJ ROCKET produces a glyph for a LADY ASTRONAUT, thus a change of meaning and I think that it went to UTC as there was a change of meaning but I am not congruently sure of that.. SQUARE ZWJ CHESSPIECE or CHESSPIECE ZWJ SQUARE produces a CHESSPIECE ON A SQUARE, thus a change of meaning. So the question is not about the chess encoding but about the original comment that claimed " - relying solely on ligatures has the benefit of not involving the UTC at all, therefore it could be implemented today without delay).". > PS: I assume it's safe to ignore the rest of your message, being based on a wrong premise? Well, not a wrong premise. Actually he rest of the post was about other aspects as well as that question, including some text about my experience with a metal chess fount and a puzzle that I hope that you will enjoy. William Overington Wednesday 5 April 2017 From everson at evertype.com Wed Apr 5 12:29:33 2017 From: everson at evertype.com (Michael Everson) Date: Wed, 5 Apr 2017 18:29:33 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <10737369.47241.1491409684891.JavaMail.defaultUser@defaultHost> References: <4919039.28328.1491394049217.JavaMail.root@webmail43.bt.ext.cpcloud.co.uk> <30044668.29483.1491394966847.JavaMail.defaultUser@defaultHost> <10737369.47241.1491409684891.JavaMail.defaultUser@defaultHost> Message-ID: <0FA171D1-97FD-4628-AAF5-40351B6034A7@evertype.com> On 5 Apr 2017, at 17:28, William_J_G Overington wrote: > Well, whether there is a need to use a ZWJ or no need to use a ZWJ is not here the issue. There isn?t. We should use VS just as we do with maths and Myanmar characters. > I then asked, the question worded differently from how it is worded here, about whether UTC needs to be involved where a character sequence that contains one or more ZWJ characters generates a glyph with a meaning different from the meaning of the original sequence that did not have the one or more ZWJ characters included. The proposal has been made for Standardized Variation Sequences. > For example, p ZWJ p produces a pp ligature with no change of meaning. A ZWJ is not necessary to produce a pp ligature. > For example, where WOMAN ZWJ ROCKET produces a glyph for a LADY ASTRONAUT, thus a change of meaning and I think that it went to UTC as there was a change of meaning but I am not congruently sure of that.. That is a matter of emoji which is not ?normal? symbol usage and is not really analogous to what we are discussing here. > SQUARE ZWJ CHESSPIECE or CHESSPIECE ZWJ SQUARE produces a CHESSPIECE ON A SQUARE, thus a change of meaning. No, it?s not. CHESSPIECE is still CHESSPIECE. The glyph for CHESSPIECE needs to be altered in order to make it suitable to use the characters in a way which will permit the presentation and interchange of chessboard matrices. Michael Everson From verdy_p at wanadoo.fr Wed Apr 5 14:13:57 2017 From: verdy_p at wanadoo.fr (Philippe Verdy) Date: Wed, 5 Apr 2017 21:13:57 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <10737369.47241.1491409684891.JavaMail.defaultUser@defaultHost> References: <4919039.28328.1491394049217.JavaMail.root@webmail43.bt.ext.cpcloud.co.uk> <30044668.29483.1491394966847.JavaMail.defaultUser@defaultHost> <10737369.47241.1491409684891.JavaMail.defaultUser@defaultHost> Message-ID: 2017-04-05 18:28 GMT+02:00 William_J_G Overington : > For example, where WOMAN ZWJ ROCKET produces a glyph for a LADY ASTRONAUT, > thus a change of meaning and I think that it went to UTC as there was a > change of meaning but I am not congruently sure of that.. > > SQUARE ZWJ CHESSPIECE or CHESSPIECE ZWJ SQUARE produces a CHESSPIECE ON A > SQUARE, thus a change of meaning. > You're right here. The absence of ZWJ clearly means separate symbols side by side (wether they will align vertically or match their metrics is not relevant here but we already see that this is a problem for displaying actual boards with the "method" proposed by Micheal Everson for use in plain text, which just looks for me as only a hack (not a serious encoding proposal), just as if we were replacing all German sharp s letters by Greek beta letters, only because they more or less "look the same". You can perfectly have a board displayed beside normal text which may contain some chess pieces, not intended to combine with the surrounding board, even if both symbols may also appear side by side (with independant metrics) in text paragraphs. Given what has been encoded for other Emojis, ZWJ should be usd between symbols that are supposed to combine visually (such as MAN+WOMAN). The encoding should still respect the logic, just like we do in normal scripts (independantly of the fact they may have different visual ordering/layout, or could have similar glyphs properly disunified because of their needed distinct semantic properties). Note als othat these "chess pieces" are not just intended to be used only with chesses, and various board types may be used (not only with square cells, for example there are rectangular ones or triangular for Shogi pieces in Japan, the cell colors also have their own meanings, and special boards may have their own cells changing colors to add other rules). Note that Shogi has other pieces with distinct semantics. The pieces are generally flat and can be tuned to the other side to show their promotion. Traditional pieces use cursive Kanjis, but there are modernised **variants** using linear glyph shapes, or westernized shapes with Latin letters or geometric symbols, or even reusing the chess pieces (including the Queen for the Gold General; or the King for the Jewel/Jade General/Master and for its "White" Challenger), but making distinctions between horses (horses-dragoons) and cavalry. When promoting using chess pieces, the promotion may be shown by placing the chess piece.on top of a draught piece or coin/token. Coins/tokens are used to promote pawns (just stack two pieces like in draught game). -------------- next part -------------- An HTML attachment was scrubbed... URL: From everson at evertype.com Wed Apr 5 14:32:44 2017 From: everson at evertype.com (Michael Everson) Date: Wed, 5 Apr 2017 20:32:44 +0100 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4919039.28328.1491394049217.JavaMail.root@webmail43.bt.ext.cpcloud.co.uk> <30044668.29483.1491394966847.JavaMail.defaultUser@defaultHost> <10737369.47241.1491409684891.JavaMail.defaultUser@defaultHost> Message-ID: <5F9BCF6C-D351-4DFF-A972-2B251B4282CF@evertype.com> It?s wonderful that Mr Verdy opposes my proposal. I must be doing something right. On 5 Apr 2017, at 20:13, Philippe Verdy wrote: > 2017-04-05 18:28 GMT+02:00 William_J_G Overington : > For example, where WOMAN ZWJ ROCKET produces a glyph for a LADY ASTRONAUT, thus a change of meaning and I think that it went to UTC as there was a change of meaning but I am not congruently sure of that.. > > SQUARE ZWJ CHESSPIECE or CHESSPIECE ZWJ SQUARE produces a CHESSPIECE ON A SQUARE, thus a change of meaning. > > You're right here. The absence of ZWJ clearly means separate symbols side by side Wrong. ZWJ has no particular directional semantics. > (wether they will align vertically or match their metrics is not relevant here but we already see that this is a problem for displaying actual boards with the "method" proposed by Micheal Everson for use in plain text, I have no trouble whatsoever making use of the three prototype fonts which make use of variation selectors to set chessboards of various sizes and with pieces anywhere I need them to be. The proposal document clearly shows examples of the boards, set with the fonts using the substitutions I specify. What, then, is the problem for display? > which just looks for me as only a hack (not a serious encoding proposal), It is quite serious. It solves a long-standing problem which everyone has ignored. > just as if we were replacing all German sharp s letters by Greek beta letters, only because they more or less "look the same?. Lovely! A completely random analogy that has nothing whatsoever to do with this proposal. > You can perfectly have a board displayed beside normal text which may contain some chess pieces, not intended to combine with the surrounding board, even if both symbols may also appear side by side (with independant metrics) in text paragraphs. Yes, Mr Verdy. That?s just exactly what my proposal says. You can use one font, with some extra glyphs attained by use of VS, to set chesspieces in text and to set chessboards alongside them. All using Unicode characters, not competing ASCII encodings which prevent harmonization of chessboard data now. There?s even an example of this in my proposal. Perhaps you didn?t read it. Can you find the Figure I refer to? > Given what has been encoded for other Emojis, ZWJ should be usd between symbols that are supposed to combine visually (such as MAN+WOMAN). Chess characters aren?t emojis. > The encoding should still respect the logic, The logic of the use of VS in this proposal is no different from the logic used with them in maths, or in Myanmar, or even in some emoji. > just like we do in normal scripts (independantly of the fact they may have different visual ordering/layout, or could have similar glyphs properly disunified because of their needed distinct semantic properties). A pawn is a pawn is a pawn. Sometimes I need the glyph for a pawn to appear in a certain way in order to do something nice like set a chessboard. > Note als othat these "chess pieces" are not just intended to be used only with chesses, If there are other uses which can be made of chess pieces, then those uses can be investigated in due course by someone interested in that. > and various board types may be used (not only with square cells, for example there are rectangular ones or triangular for Shogi pieces in Japan, Shogi is not chess. Shogi notation is not like chess notation, either. Try to focus on the actual proposal. > the cell colors also have their own meanings, and special boards may have their own cells changing colors to add other rules). Red herring. This has nothing to do with the PRIMARY USE of chess characters, which is inline in text to describe chess problems in various notations, and also to set chessboard diagrams. > Note that Shogi has other pieces with distinct semantics. Shogi isn?t chess. > The pieces are generally flat and can be tuned to the other side to show their promotion. Traditional pieces use cursive Kanjis, but there are modernised **variants** using linear glyph shapes, or westernized shapes with Latin letters or geometric symbols, or even reusing the chess pieces (including the Queen for the Gold General; or the King for the Jewel/Jade General/Master and for its "White" Challenger), but making distinctions between horses (horses-dragoons) and cavalry. When promoting using chess pieces, the promotion may be shown by placing the chess piece.on top of a draught piece or coin/token. Coins/tokens are used to promote pawns (just stack two pieces like in draught game). Shogi isn?t chess. I thank Mr Verdy for his defence of my proposal. Michael Everson From wjgo_10009 at btinternet.com Wed Apr 5 14:45:57 2017 From: wjgo_10009 at btinternet.com (William_J_G Overington) Date: Wed, 5 Apr 2017 20:45:57 +0100 (BST) Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <11942252.59673.1491420041683.JavaMail.root@webmail43.bt.ext.cpcloud.co.uk> References: <11942252.59673.1491420041683.JavaMail.root@webmail43.bt.ext.cpcloud.co.uk> Message-ID: <19941195.63023.1491421557775.JavaMail.defaultUser@defaultHost> >> As it happens, Quest text also has eight glyphs for producing a border, all eight being in the Private Use Area. They are rather ornate. They are at U+E5B0 through to U+E5B7. Michael Everson wrote: > They are there. I had to figure out how the should be used. They are put together in a very different way than the borders of any other font I have seen are. I am not sure, but I think he?s intended to use them thus: > [Pic of the Looking-Glass board in William?s font] > http://evertype.com/standards/unicode-list/overington-board.png Yes, Michael has set out the border as I intended it to be used. Thank you. > William?s design is decidedly non-traditional, and not (to my eye) particularly easy to read, but it doesn?t matter. The picture here shows his glyphs configured in exactly the same way as specified in my proposal. IT WORKS. (There are some hairline gaps in the border and the top left corner piece is a little less well aligned than one would if one were preparing to ship the font.) Thank you for producing the picture. Yes, there are some hairline gaps in the border. It happens in some places when using the font in PagePlus X7 here: they appear to be rounding errors in the rendering system. Maybe I can try to make the glyphs for the two left side border corners and the upper and lower border horizontals each a bit wider than the advance width. Line spacing a little less than it should be for the font size in the application program might stop any vertical hairlines without altering the font, if indeed altering the font vertically would work anyway and I am unsure at present whether it would or not. However, the issue with the top left corner piece is not a font issue and that issue does not occur when using the font with PagePlus X7. If I do alter the font, or make a variant version, then I will need to check what happens if glyphs overlap when producing a PDF document before finalizing anything. > Thank you for sharing your font, William. I?ll send you the ttf of this one so you can tinker with glyph placement as you wish, if the proposal is accepted and the standardized variation sequences accepted. Thank you. William Overington Wednesday 5 April 2017 From verdy_p at wanadoo.fr Wed Apr 5 15:26:01 2017 From: verdy_p at wanadoo.fr (Philippe Verdy) Date: Wed, 5 Apr 2017 22:26:01 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: <5F9BCF6C-D351-4DFF-A972-2B251B4282CF@evertype.com> References: <4919039.28328.1491394049217.JavaMail.root@webmail43.bt.ext.cpcloud.co.uk> <30044668.29483.1491394966847.JavaMail.defaultUser@defaultHost> <10737369.47241.1491409684891.JavaMail.defaultUser@defaultHost> <5F9BCF6C-D351-4DFF-A972-2B251B4282CF@evertype.com> Message-ID: 2017-04-05 21:32 GMT+02:00 Michael Everson : > It?s wonderful that Mr Verdy opposes my proposal. I must be doing > something right. > > On 5 Apr 2017, at 20:13, Philippe Verdy wrote: > > > 2017-04-05 18:28 GMT+02:00 William_J_G Overington < > wjgo_10009 at btinternet.com>: > > For example, where WOMAN ZWJ ROCKET produces a glyph for a LADY > ASTRONAUT, thus a change of meaning and I think that it went to UTC as > there was a change of meaning but I am not congruently sure of that.. > > > > SQUARE ZWJ CHESSPIECE or CHESSPIECE ZWJ SQUARE produces a CHESSPIECE ON > A SQUARE, thus a change of meaning. > > > > You're right here. The absence of ZWJ clearly means separate symbols > side by side > > Wrong. ZWJ has no particular directional semantics. > NO! I did nit give any direction. Direc tion is a separate issue (if you mean there the Bidi algorithm) "Side by side" does not prohibit ligatures but this is just like with letters "side by side" where ordering is defined independantly. So I maintain what I replied. The **absence** of ZWJ clearly means separate symbols side by side (minus typographic/styling effects such as joining and **partial** overlays or kerning: this excludes overlays and complex ligatures that are in Unicode treated with separate encodings; partial overlays include kerning, or simple syllabic composition in 2D layouts for Hangul, Kana/Romaji squares, but excludes complex compositions for Hanzi/Kanji which are encoded specifically). -------------- next part -------------- An HTML attachment was scrubbed... URL: From beckiergb at gmail.com Wed Apr 5 15:35:45 2017 From: beckiergb at gmail.com (Rebecca Bettencourt) Date: Wed, 5 Apr 2017 13:35:45 -0700 Subject: PETSCII mapping? In-Reply-To: References: <38d70a68-aabe-a6d1-50cf-cbdf2f92b88f@ix.netcom.com> Message-ID: You can find charts of complete PETSCII character sets here: http://www.kreativekorp.com/software/fonts/c64.shtml The missing characters are a handful of block elements: upper fractional blocks (Unicode only has lower), halves of MEDIUM SHADE, checkerboards and diagonals. I can put together a unified chart, with mappings to Unicode where they exist. In fact I think I'll do that. :) I'm all willing to help put together a proposal for encoding missing block element characters, but I would need other people to a) gather evidence of use in plain text and b) write up the proposal in Unicode's formal language since I've never proposed characters to Unicode before. (Additionally, I wonder if we could find evidence of the Apple II's or TRS-80's characters in use in plain text as well. Not necessarily saying those should be encoded as well, just that we should investigate.) -- Rebecca Bettencourt On Wed, Apr 5, 2017 at 12:47 PM, Murray Sargent < murrays at exchange.microsoft.com> wrote: > What PETSCII characters aren?t already in Unicode? A couple geometric > symbols? Looks mostly like a simple codepage translation. > > > > Murray > > > > *From:* Unicode [mailto:unicode-bounces at unicode.org] * On Behalf Of *Rebecca > Bettencourt > *Sent:* Wednesday, April 5, 2017 9:42 AM > *To:* Asmus Freytag > *Cc:* unicode > *Subject:* Re: PETSCII mapping? > > > > On Wed, Apr 5, 2017 at 3:18 AM, Asmus Freytag > wrote: > > Unicode is not an archive of anything ever used on computers. > > > > Why not? Isn't one of Unicode's goals to support the conversion of > documents using legacy character sets into Unicode? I do not understand > why, say, the entire IBM PC character set is eligible for encoding, but not > the entire Commodore 64 character set. > > > > > > Were there word processors on the Commodore 64 that allowed the input of > PETSCII characters? Could documents written using that software demonstrate > a need to encode those characters? What about instruction manuals, magazine > articles, and program listings that used PETSCII characters in running > text? Surely there must be more than enough examples for a computer as > popular as the Commodore 64. > -------------- next part -------------- An HTML attachment was scrubbed... URL: From kent.karlsson14 at telia.com Wed Apr 5 16:13:44 2017 From: kent.karlsson14 at telia.com (Kent Karlsson) Date: Wed, 05 Apr 2017 23:13:44 +0200 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: Message-ID: Den 2017-04-05 16:48, skrev "Michael Everson" : Kent, I can?t read this in a plain-text e-mail. Well, it was SUPPOSED to be explicit HTML code in the email. It was NOT the intent that the given example was to be rendered directly in the email (even if you have HTML emails enabled). Further, I would write the code a bit differently, in order to easily be able to map your proposed encoding for (parts of) chessboards to HTML. But at this point I did not want to change the referenced example (written by someone posting to stackoverflow.com) in any significant way. So yes, if you want to see the result of the HTML code, paste the HTML code to a plan text editor, name the file you save it to "chess.html", and view that file in a browser. That display in turn may be cut and pasted to another document, depending on the capabilities of the app used to edit that other document. The paste may, admittedly result in an awful and uneditable result. I agree that the HTML code is a bit of a mouthful (and I would also do it a bit differently), and also has the problem mentioned in the previous paragraph). Which is why I support your proposal, but with these modifications: - with the extra requirement to have VSs also for the boarder line drawing characters (to make them fit for drawing chess board boarders, in a general purpose font), and - some bidi fix [preferably making the box/border drawing characters bidi "L", if possible; otherwise a caveat that if there is an expectation to paste in such a board into an RTL document, bidi controls need be used to LTR the board]). Nit: You sometimes seem to have made the line spacing slightly larger (like 2 points) larger than the character width. Should they not be exactly the same, to get the best (square) display of the chess boards? (Not that it is very visible, but a bit.) /Kent K PS I think the "ligatures" approach is a dead end. - As you mention, the fallback will have very different line lengths for the lines of a board display, and thus basically unreadable. - If ZWJ is not needed, one will need two *new* characters that (in some fonts) ligate with chess pieces. No existing character should ever ligate with chess pieces. - If ZWJ is needed, then one can use some existing characters as board squares. - In either case, it is not clear (or obvious) which should come first, a chess piece or a board square. There will surely be mistakes, giving them in the wrong order (not a problem in your proposal). - My personal guesstimate is that there will be much fewer fonts that would implement the ligation (if that approach was to be chosen), than would implement the VS approach you are suggesting. Thus I support your proposal, since that gives: - Good fallback (readable, though ugly). - Fairly good display when the VS sequences are interpreted (and the font is otherwise reasonable), and "good" context (line height setting, not too short lines so that auto line breaking is avoided, ...). - Easier to machine parse than the ligatures approach; and MUCH easier to parse than an HTML version. - Easy to convert to (say) HTML for even better display in (say) HTML pages (CAN look much better, and NO dependence on line height setting or line width setting (or bidi direction derivations), but just that the table (for the board) is reasonably done. Den 2017-04-05 16:48, skrev "Michael Everson" : > Kent, I can?t read this in a plain-text e-mail. I can?t paste it into an > ordinary word-processor like Word as in my previous response to Markus, or in > Pages (left) or LibreOffice (right) as shown here. (I simply pasted in the > text from Word to each of those. It?s odd to see that there is some variation > in display the text without selecting it and applying the correctly-configured > font to it, but when that?s done, the correct display is given (modulo some > leading issues which I didn?t focus on in either). > > The workaround you give is just that. It works. It?s not usefully portable or > user-friendly, and as higher-letter protocols go, it hasn?t swept away all > competition for presenting chessboards. People use ASCII or MS Symbol-based > fonts not even with any Unicode characters in them. -------------- next part -------------- An HTML attachment was scrubbed... URL: From 637275 at gmail.com Wed Apr 5 16:25:52 2017 From: 637275 at gmail.com (Rebecca T) Date: Wed, 5 Apr 2017 17:25:52 -0400 Subject: PETSCII mapping? In-Reply-To: <38d70a68-aabe-a6d1-50cf-cbdf2f92b88f@ix.netcom.com> References: <38d70a68-aabe-a6d1-50cf-cbdf2f92b88f@ix.netcom.com> Message-ID: > If there's a credible need to convert files between Unicode-based systems and > those using PETSCII There is! It?s called ?sharing textual information? and it?s how our society functions. Can we afford to blithely abandon data from the best selling computer in history [1] because nobody cared to standardize its? > A similar scenario might exist if C64 emulators run on Unicode-based systesm > were a widespread phenomenon They do! Even last month, there was a PETSCII directory-art contest. [2] A bit off-topic, but: As time goes on, ?not in widespread use? will become a flimsier and flimsier argument against inclusion ? why isn?t there a larger community of PETSCII enthusaists? Partially because the only way to share PETSCII is through images! The consortium (passively or actively) prevents communication through exclusion and then uses the lack of communication as a justification against inclusion ? it?s a poor, tautological argument, and it won?t serve the consortium long-term. Simply put, we need new criteria for inclusion ? as the vast majority of the world?s systems (from written communication in text messages to the manuscripts of all new books) are already Unicode-based, we can no longer rely on a character?s existing presence outside of Unicode as a signal to warrent inclusion; we must weigh a character?s merits and usability on its own. (does it fill a gap in communication? Will it be used?) [1]: http://www.cnn.com/2011/TECH/gaming.gadgets/05/09/commodore.64.reborn/ [2]: http://csdb.dk/event/?id=2558 -------------- next part -------------- An HTML attachment was scrubbed... URL: From richard.wordingham at ntlworld.com Wed Apr 5 16:48:17 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Wed, 5 Apr 2017 22:48:17 +0100 Subject: Coloured Punctuation and Annotation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

Message-ID: <0A60DF73-289F-40E6-A232-C4CB2593B794@evertype.com> On 6 Apr 2017, at 00:12, James Kass wrote: > > Kent Karlsson wrote, > >> - with the extra requirement to have VSs also for the boarder line drawing characters (to make them fit for drawing chess board boarders, in a general purpose font), and > > This doesn't seem necessary. A general purpose font modified to display the chess board in plain text in accordance with Michael Everson's proposal would be expected to use the same metrics as the box drawing glyphs for all of the VS-produced glyphs. A general purpose font *not* so modified would not be expected to display the chessboard in a perfect square, anyway. (Yet the display would still be legible.) Well. 1) A general purpose font that wanted to support chessboards as well as legacy graphic terminals would make use of VS for the border characters in order to be able to do both. 2) If we decided to standardizing on that would have to burden chess-font designers with either a) learning how to draw graphic terminal characters correctly in their chess fonts along with the characters + VS for actual use b) ignoring graphic terminal character shapes and just pasting in the chess shapes to those code positions Michael Everson From asmusf at ix.netcom.com Wed Apr 5 19:29:57 2017 From: asmusf at ix.netcom.com (Asmus Freytag) Date: Wed, 5 Apr 2017 17:29:57 -0700 Subject: Coloured Punctuation and Annotation In-Reply-To: <0D5792F5-0D5E-47F4-BA9B-6FB4BC555BAA@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

Message-ID: Rebecca Bettencourt wrote, > I can put together a unified chart, with mappings to Unicode where > they exist. In fact I think I'll do that. :) I hope you do. That would be a good starting point. > I'm all willing to help put together a proposal for encoding missing > block element characters, but I would need other people to a) gather > evidence of use in plain text and b) write up the proposal in Unicode's > formal language since I've never proposed characters to Unicode before. Even the most prolific of our proposers had to start someplace... > As time goes on, ?not in widespread use? will become a flimsier > and flimsier argument against inclusion... Agreed. As arguments go, that one was never very robust. Best regards, James Kass From lokedhs at gmail.com Wed Apr 5 21:40:19 2017 From: lokedhs at gmail.com (=?UTF-8?Q?Elias_M=C3=A5rtenson?=) Date: Thu, 6 Apr 2017 10:40:19 +0800 Subject: PETSCII mapping? In-Reply-To: References: <38d70a68-aabe-a6d1-50cf-cbdf2f92b88f@ix.netcom.com>

Message-ID: On 6 April 2017 at 09:44, James Kass wrote: > Rebecca Bettencourt wrote, > > > I can put together a unified chart, with mappings to Unicode where > > they exist. In fact I think I'll do that. :) > > I hope you do. That would be a good starting point. > The Wikipedia page on PETSCII has a character map where the missing characters are highlighted. Based on my count, there are 31 missing symbols. Those should be reasonably simple to document and highlight. Do we also have to create an example font that includes these symbols? That seems to be what Michael Everson did for his chess notation proposal that I read recently. Then there is the issue of what to do with the text colour and style selectors. PETSCII has characters that indicate a colour change as well as reverse video. At least the reverse video one is important, as it's being used to construct new characters. For example, PETSCII only has a single character "half block" (top part filled). The way you represent a half block with the bottom part filled is to use the reverse video together with the former. It would probably make more sense to represent the reversed symbols as separate code points? Regards, Elias -------------- next part -------------- An HTML attachment was scrubbed... URL: From christoph.paeper at crissov.de Wed Apr 5 22:01:23 2017 From: christoph.paeper at crissov.de (=?UTF-8?Q?Christoph_P=C3=A4per?=) Date: Thu, 6 Apr 2017 05:01:23 +0200 (CEST) Subject: Emoji Compatibility Symbols In-Reply-To: References: Message-ID: <1947780010.43276.1491447683535.JavaMail.open-xchange@app06.ox.hosteurope.de> Charlotte Buff : > > That document was very helpful, but unfortunately many of the images are > missing. would fix that. From duerst at it.aoyama.ac.jp Wed Apr 5 22:24:21 2017 From: duerst at it.aoyama.ac.jp (=?UTF-8?Q?Martin_J._D=c3=bcrst?=) Date: Thu, 6 Apr 2017 12:24:21 +0900 Subject: Proposal to add standardized variation sequences for chess notation In-Reply-To: References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

Message-ID: On 6 April 2017 at 09:44, James Kass wrote: > Rebecca Bettencourt wrote, > > > I can put together a unified chart, with mappings to Unicode where > > they exist. In fact I think I'll do that. :) > > I hope you do. That would be a good starting point. > I'm working on it! On Wed, Apr 5, 2017 at 7:40 PM, Elias M?rtenson wrote: > Do we also have to create an example font that includes these symbols? > That seems to be what Michael Everson did for his chess notation proposal > that I read recently. > We do have to provide Unicode with fonts, I believe. We can use an existing C64 font, such as Pet Me. Or, we can create a new font with vectorized versions of the characters. > Then there is the issue of what to do with the text colour and style > selectors. PETSCII has characters that indicate a colour change as well as > reverse video. At least the reverse video one is important, as it's being > used to construct new characters. For example, PETSCII only has a single > character "half block" (top part filled). The way you represent a half > block with the bottom part filled is to use the reverse video together with > the former. > > It would probably make more sense to represent the reversed symbols as > separate code points? > I would actually leave the color-change and reverse-video characters to a higher-level protocol. > > Regards, > Elias > -------------- next part -------------- An HTML attachment was scrubbed... URL: From richard.wordingham at ntlworld.com Wed Apr 5 23:41:07 2017 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Thu, 6 Apr 2017 05:41:07 +0100 Subject: Coloured Punctuation and Annotation In-Reply-To: <9FCA9B1F-00D7-459E-8567-59589609A708@evertype.com> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

Message-ID: The Wikipedia page for PETSCII [1] only marks 20 characters as not having Unicode equivalents; 2px (light) and 3px (heavy) horizontal and vertical bars at various non-center positions, diagonal shading characters, and corner characters. I?ve done some processing to the table on [1] to filter out the missing characters ? their exact codepoints and descriptions can be found in [2]. These characters are highlighted in red in the attached image (green characters are also missing but are duplicates of other characters in the chart), and marked by U+FFFD ? in the compact table [3]. The box-drawing characters seem to semantically represent lines (boxes) and the block elements seem to represent shapes and shades; this makes $7c, $7f, $a7, $a8, $a9, $b6, $b7, and $b8 block elements and the rest box-drawing characters. [1]: https://en.m.wikipedia.org/wiki/PETSCII [2]: https://github.com/9999years/Unicode-PETSCII/blob/master/new.txt [3]: https://github.com/9999years/Unicode-PETSCII/blob/master/graphic-table.txt [image: Inline image 1] On Wed, Apr 5, 2017 at 11:32 PM, Rebecca Bettencourt wrote: > On 6 April 2017 at 09:44, James Kass wrote: > >> Rebecca Bettencourt wrote, >> >> > I can put together a unified chart, with mappings to Unicode where >> > they exist. In fact I think I'll do that. :) >> >> I hope you do. That would be a good starting point. >> > > I'm working on it! > > On Wed, Apr 5, 2017 at 7:40 PM, Elias M?rtenson wrote: > >> Do we also have to create an example font that includes these symbols? >> That seems to be what Michael Everson did for his chess notation proposal >> that I read recently. >> > > We do have to provide Unicode with fonts, I believe. We can use an > existing C64 font, such as Pet Me. Or, we can create a new font with > vectorized versions of the characters. > > >> Then there is the issue of what to do with the text colour and style >> selectors. PETSCII has characters that indicate a colour change as well as >> reverse video. At least the reverse video one is important, as it's being >> used to construct new characters. For example, PETSCII only has a single >> character "half block" (top part filled). The way you represent a half >> block with the bottom part filled is to use the reverse video together with >> the former. >> >> It would probably make more sense to represent the reversed symbols as >> separate code points? >> > > I would actually leave the color-change and reverse-video characters to a > higher-level protocol. > > >> >> Regards, >> Elias >> > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: new-characters.png Type: image/png Size: 21904 bytes Desc: not available URL: From irgendeinbenutzername at gmail.com Thu Apr 6 00:14:49 2017 From: irgendeinbenutzername at gmail.com (Charlotte Buff) Date: Thu, 6 Apr 2017 07:14:49 +0200 Subject: PETSCII mapping? Message-ID: Rebecca Bettencourt wrote: > I'm all willing to help put together a proposal for encoding missing block > element characters, but I would need other people to a) gather evidence of > use in plain text and b) write up the proposal in Unicode's formal language > since I've never proposed characters to Unicode before. I'm in the process of preparing a proposal for several old character sets, one of them PETSCII. At the moment I am still mostly concerned with analyzing the sets and determining which characters can sensibly be unified with existing ones and how to best structure the included repertoire. Currently I am quite busy with university stuff so things progress rather slowly, though. -------------- next part -------------- An HTML attachment was scrubbed... URL: From 637275 at gmail.com Thu Apr 6 00:19:42 2017 From: 637275 at gmail.com (Rebecca T) Date: Thu, 6 Apr 2017 01:19:42 -0400 Subject: Coloured Punctuation and Annotation In-Reply-To: <20170406054107.20e40bd5@JRWUBU2> References: <4783FDDC-4F0B-4FE2-ABCA-09A09884C011@evertype.com>

♜	♞	♝	♛	♚	♝	♞	♜
♟	♟	♟	♟	♟	♟	♟	♟




♙	♙	♙	♙	♙	♙	♙	♙
♖	♘	♗	♕	♔	♗	♘	♖

♜	♞	♝	♛	♚	♝	♞	♜
♟	♟	♟	♟	♟	♟	♟	♟




♙	♙	♙	♙	♙	♙	♙	♙
♖	♘	♗	♕	♔	♗	♘	♖

♜	♞	♝	♛	♚	♝	♞	♜
♟	♟	♟	♟	♟	♟	♟	♟




♙	♙	♙	♙	♙	♙	♙	♙
♖	♘	♗	♕	♔	♗	♘	♖

♜	♞	♝	♛	♚	♝	♞	♜
♟	♟	♟	♟	♟	♟	♟	♟




♙	♙	♙	♙	♙	♙	♙	♙
♖	♘	♗	♕	♔	♗	♘	♖

♜	♞	♝	♛	♚	♝	♞	♜
♟	♟	♟	♟	♟	♟	♟	♟




♙	♙	♙	♙	♙	♙	♙	♙
♖	♘	♗	♕	♔	♗	♘	♖

♜	♞	♝	♛	♚	♝	♞	♜
♟	♟	♟	♟	♟	♟	♟	♟




♙	♙	♙	♙	♙	♙	♙	♙
♖	♘	♗	♕	♔	♗	♘	♖

♜	♞	♝	♛	♚	♝	♞	♜
♟	♟	♟	♟	♟	♟	♟	♟




♙	♙	♙	♙	♙	♙	♙	♙
♖	♘	♗	♕	♔	♗	♘	♖

♜	♞	♝	♛	♚	♝	♞	♜
♟	♟	♟	♟	♟	♟	♟	♟




♙	♙	♙	♙	♙	♙	♙	♙
♖	♘	♗	♕	♔	♗	♘	♖

♜	♞	♝	♛	♚	♝	♞	♜
♟	♟	♟	♟	♟	♟	♟	♟




♙	♙	♙	♙	♙	♙	♙	♙
♖	♘	♗	♕	♔	♗	♘	♖

♜	♞	♝	♛	♚	♝	♞	♜
♟	♟	♟	♟	♟	♟	♟	♟




♙	♙	♙	♙	♙	♙	♙	♙
♖	♘	♗	♕	♔	♗	♘	♖