<div dir="ltr"><div dir="ltr">Martin J. Dürst (<a href="mailto:duerst@it.aoyama.ac.jp">duerst@it.aoyama.ac.jp</a>) wrote:<br></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Hello Jukka, others,<br>
<br>
On 2021/03/18 17:20, Jukka K. Korpela via Unicode wrote:<br>
> Tex (<a href="mailto:textexin@xencraft.com" target="_blank">textexin@xencraft.com</a>) wrote:<br>
<br>
>> However, you are quoting a doc that has been withdrawn.<br>
<br>
> It’s a pity that this well-written and useful document was withdrawn, for<br>
> reasons I don’t understand.<br>
<br>
Here are the main reasons, as far as I understand them. Unicode gets <br>
updated roughly once a year, and Web technology also changes over time. <br>
There was not enough manpower to keep the document up to date.<br>
<br>
In addition, the document was always a kind of tug-of-war between those <br>
who pushed for more favorable descriptions of specific Unicode <br>
characters (such as ⁴ in this discussion) or more favorable descriptions <br>
of markup-based and style-based solutions (such as <sup></sup>).</blockquote><div><br></div><div>Thank you for the description. These opposite views surely reflected different needs, such as the need to represent data in plain text in some contexts and the need for more structured representation.</div><div><br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Well, an then somebody else uses 10<sup>3.5</sup> somewhere. How are you <br>
going to express this so that it doesn't turn into 103.5 in plain text? <br>
The problem is that there is always a limit somewhere for plain text.</blockquote><div><br></div><div>Well, in the given case, it might help if we had IMPLIED EXPONENTIATION (we don’t; we have IMPLIED TIMES, but it does not help here); at least it would appear in text data to indicate that adjacent digits are not part of the same number.</div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"> <br>
There is also always a limit somewhere for markup and styled rendering, <br>
but it's in a quite different place.<br></blockquote><div><br></div><div>Regarding exponents, the limit is currently set by the presence of superscript characters for digits, plus, and minus, and (for some reason), =, (, ), and n. This covers most of the cases where one might consider using superscripts in general texts and in expressing values of quantities.</div><div> <br>But when you have, say, text that contains the simple expression <i>ax </i>with <i>x</i> as a superscript denoting exponent there is no satisfactory way to represent it in plain text. Using just ax would mean using a wrong expression, and using aˣ (with U+02E3 MODIFIER LETTER SMALL X) would be too tricky. Unicode hasn’t got a repertoire of superscript Latin letters even though they are often used as semantically different from normal letters; it only has some of such letters, apparently meant for special uses only (like phonetic symbols).</div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<br>Out of the box rendering of <sup> and <sub> may be rather crude, but I <br>
guess it should be possible to do a lot better with some dose of CSS and <br>
possibly some Web fonts.<br></blockquote><div><br></div><div>In a sense, it would be straightforward to map, say, <sup>2</sup> to SUPERSCRIPT TWO in the rendering phase, either directly at the character level or via glyph selection when an OpenType font is used. In another sense, it would be complicated, since we hardly want to have <sup>2</sup> rendered substantially different from <sup>x</sup> in style. So the mapping should take place only when the entire document contains only such <sup> elements where are characters have superscript counterparts in Unicode (or at the glyph level).<br><br>Jukka </div></div></div>