From aprilop at fn.de Tue Nov 1 08:19:34 2022 From: aprilop at fn.de (Andreas Prilop) Date: Tue, 01 Nov 2022 13:19:34 +0000 Subject: U+2212 Minus Sign in Computer Languages? Message-ID: Is there any computer language that uses U+2212 as minus sign? From abrahamgross at disroot.org Tue Nov 1 08:21:08 2022 From: abrahamgross at disroot.org (ag disroot) Date: Tue, 1 Nov 2022 09:21:08 -0400 (EDT) Subject: U+2212 Minus Sign in Computer Languages? In-Reply-To: References: Message-ID: Perl 6 / raku probably From marius.spix at web.de Tue Nov 1 10:04:06 2022 From: marius.spix at web.de (Marius Spix) Date: Tue, 1 Nov 2022 16:04:06 +0100 Subject: U+2212 Minus Sign in Computer Languages? In-Reply-To: References: Message-ID: <20221101160406.09e5d01a@spixxi> I tried it in Perl 5, where it does not work. However, it works in APL as monadic negation sign. Am Tue, 1 Nov 2022 09:21:08 -0400 (EDT) schrieb ag disroot via Unicode : > Perl 6 / raku probably From abrahamgross at disroot.org Tue Nov 1 10:15:58 2022 From: abrahamgross at disroot.org (ag disroot) Date: Tue, 1 Nov 2022 11:15:58 -0400 (EDT) Subject: U+2212 Minus Sign in Computer Languages? In-Reply-To: <20221101160406.09e5d01a@spixxi> References: <20221101160406.09e5d01a@spixxi> Message-ID: Perl 5 and Perl 6 are two completely different programming languages. Perl 6 is the only programming language that properly supports Unicode From wunnakoko at gmail.com Wed Nov 2 08:40:24 2022 From: wunnakoko at gmail.com (Wunna Ko) Date: Wed, 2 Nov 2022 09:40:24 -0400 Subject: Encoding of Text in the Myanmar Script In-Reply-To: <20221028231048.6db1bf66@JRWUBU2> References: <20221026030249.6807ed65@JRWUBU2> <20221028231048.6db1bf66@JRWUBU2> Message-ID: Richard, Sorry for the slow response. My key question is, has the Government ruled which is correct? - Short answer : NO As a mother tongue user of Burmese, is a common combination but is not. But there are some users who might use in the place of . For and , I have not seen these as common combinations used in recent years although both of these are valid combinations. On Fri, Oct 28, 2022 at 6:10 PM Richard Wordingham < richard.wordingham at ntlworld.com> wrote: > On Fri, 28 Oct 2022 12:54:52 -0400 > Wunna Ko wrote: > > > Richard, > > > > There is no official Standard in Myanmar. There is no official > > encoding as well. But the Burmese users generally accepted Unicode as > > a standard. > > > > Not sure what you are referring to when you mention TUS encoding. > > The TUS encoding for Modern Burmese is Table 16-4 in the Unicode > Standard, currently (Version 15.0) on p673, in Chapter 16. Now, if one > applies those rules to the _Mon_ word generally transliterated as > 'to choose', one gets . However if one > applies the Microsoft Typography rules, one gets . > One renders properly, while the other gets a dashed circle inserted. > Safari on iPhone and Edge on Windows 10 render different ones properly. > My key question is, has the Government ruled which is correct? > > There's a similar issue with the sequence v. AI, SIGN AA>, but slightly different - Apple rejects the first, HarfBuzz > accepts both (or a font corrects the renderer), and I think UTN-11 > Version 4 is meant to reject the second as it accepts the first, but it > is not mentioned in Footnote 3 to the table on pp6-7. This is also a > nasty one, as it is not beyond the wit of man to ascribe different > renderings and even meanings to the two sequences. > > Richard. > > > -- Wunna Ko -------------- next part -------------- An HTML attachment was scrubbed... URL: From timpart at perdix.ndonet.com Wed Nov 2 20:46:20 2022 From: timpart at perdix.ndonet.com (Tim Partridge) Date: Thu, 3 Nov 2022 01:46:20 +0000 Subject: U+2212 Minus Sign in Computer Languages? In-Reply-To: References: Message-ID: According to Wikipedia, APL uses U+2122 for negation and subtraction. https://en.wikipedia.org/wiki/APL_syntax_and_symbols It references https://www.math.uwaterloo.ca/~ljdickey/apl-rep/n1.html I suspect most programming languages follow the unification of hyphen and minus on typewriter keyboards which led to early character standards doing the same. Also number formatting and parsing routines use the dual use character for negative numbers and tend not to recognise U+2122. Tim ________________________________ From: Unicode on behalf of Andreas Prilop via Unicode Sent: 01 November 2022 13:19 To: unicode at corp.unicode.org Subject: U+2212 Minus Sign in Computer Languages? Is there any computer language that uses U+2212 as minus sign? -------------- next part -------------- An HTML attachment was scrubbed... URL: From alex.plantema at xs4all.nl Thu Nov 3 04:27:23 2022 From: alex.plantema at xs4all.nl (Alex Plantema) Date: Thu, 3 Nov 2022 10:27:23 +0100 Subject: U+2212 Minus Sign in Computer Languages? In-Reply-To: References: Message-ID: Op do 03-11-2022 om 02:46 schreef Tim Partridge via Unicode: > I suspect most programming languages follow the unification of hyphen and minus on typewriter keyboards which led to early character standards doing the same. > > Also number formatting and parsing routines use the dual use character for negative numbers and tend not to recognise U+2122. > Such a unification isn't possible in postfix notation, unless negations are replaced by subtractions from zero. https://stackoverflow.com/questions/46861254/infix-to-postfix-for-negative-numbers -- Alex. -------------- next part -------------- An HTML attachment was scrubbed... URL: From frederic.grosshans at gmail.com Thu Nov 3 09:23:51 2022 From: frederic.grosshans at gmail.com (=?UTF-8?Q?Fr=c3=a9d=c3=a9ric_Grosshans?=) Date: Thu, 3 Nov 2022 15:23:51 +0100 Subject: U+2212 Minus Sign in Computer Languages? In-Reply-To: References: Message-ID: Le 01/11/2022 ? 14:19, Andreas Prilop via Unicode a ?crit?: > Is there any computer language that uses U+2212 as minus sign? > It works in Julia: julia> 3?2 1 From asmusf at ix.netcom.com Thu Nov 3 11:54:14 2022 From: asmusf at ix.netcom.com (Asmus Freytag) Date: Thu, 3 Nov 2022 09:54:14 -0700 Subject: U+2212 Minus Sign in Computer Languages? In-Reply-To: References: Message-ID: <3f6c56b3-c92b-b955-a485-73df52815029@ix.netcom.com> No program should recognize U+2122 (sic) for negation. However, they might accept U+2212... A./ On 11/2/2022 6:46 PM, Tim Partridge via Unicode wrote: > According to Wikipedia, APL uses U+2122 for negation and subtraction. > https://en.wikipedia.org/wiki/APL_syntax_and_symbols > It references > https://www.math.uwaterloo.ca/~ljdickey/apl-rep/n1.html > > > I suspect most programming languages follow the unification of hyphen > and minus on typewriter keyboards which led to early character > standards doing the same. > > Also number formatting and parsing routines use the dual use character > for negative numbers and tend not to recognise U+2122. > > ?? Tim > ------------------------------------------------------------------------ > *From:* Unicode on behalf of > Andreas Prilop via Unicode > *Sent:* 01 November 2022 13:19 > *To:* unicode at corp.unicode.org > *Subject:* U+2212 Minus Sign in Computer Languages? > Is there any computer language that uses U+2212 as minus sign? > -------------- next part -------------- An HTML attachment was scrubbed... URL: From doug at ewellic.org Thu Nov 3 12:32:22 2022 From: doug at ewellic.org (Doug Ewell) Date: Thu, 3 Nov 2022 17:32:22 +0000 Subject: U+2212 Minus Sign in Computer Languages? In-Reply-To: <3f6c56b3-c92b-b955-a485-73df52815029@ix.netcom.com> References: <3f6c56b3-c92b-b955-a485-73df52815029@ix.netcom.com> Message-ID: Asmus Freytag wrote: > No program should recognize U+2122 (sic) for negation. However, they > might accept U+2212... TM = "Treat as Minus" -- Doug Ewell, CC, ALB | Lakewood, CO, US | ewellic.org From mark at kli.org Thu Nov 3 20:07:34 2022 From: mark at kli.org (Mark E. Shoulson) Date: Thu, 3 Nov 2022 21:07:34 -0400 Subject: U+2212 Minus Sign in Computer Languages? In-Reply-To: References: Message-ID: <4a112351-a6a1-60d7-ce32-80b35363065e@shoulson.com> Well, see, now there are *three* different meanings under discussion.? There's the unary operator and the binary operator (which most programming languages unify but which cannot be unified in postfix notation, as you say), and also the negative number syntax.? APL, iirc, does indeed use "-" for both subtraction and negation, i.e. both operators, but the high-minus was *not* an operator, it was part of a numeric literal, it's how you wrote "negative three" (as opposed to writing "the negation of three.")? Given APL's strict (lack of) operator precedence, it could be inconvenient to have to write negative numbers as operators applied to positive numbers, but a syntax element doesn't have that issue.? That's why I compared it to Lojban's {ni'u} as opposed to {vu'u}. ~mark On 11/3/22 05:27, Alex Plantema via Unicode wrote: > Op do 03-11-2022 om 02:46 schreef Tim Partridge via Unicode: >> I suspect most programming languages follow the unification of hyphen >> and minus on typewriter keyboards which led to early character >> standards doing the same. >> >> Also number formatting and parsing routines use the dual use >> character for negative numbers and tend not to recognise U+2122. >> > Such a unification isn't possible in postfix notation, unless > negations are replaced by subtractions from zero. > https://stackoverflow.com/questions/46861254/infix-to-postfix-for-negative-numbers > -- > Alex. -------------- next part -------------- An HTML attachment was scrubbed... URL: From waltertross at gmail.com Fri Nov 4 06:34:19 2022 From: waltertross at gmail.com (Walter Tross) Date: Fri, 4 Nov 2022 12:34:19 +0100 Subject: U+2212 Minus Sign in Computer Languages? In-Reply-To: <4a112351-a6a1-60d7-ce32-80b35363065e@shoulson.com> References: <4a112351-a6a1-60d7-ce32-80b35363065e@shoulson.com> Message-ID: FWIW: ECMAScript (aka JavaScript) 2022, Python 3.8, Java 17: all NO, for all 3 meanings (unary/binary operator, negative number) On Fri, Nov 4, 2022 at 2:11 AM Mark E. Shoulson via Unicode < unicode at corp.unicode.org> wrote: > Well, see, now there are *three* different meanings under discussion. > There's the unary operator and the binary operator (which most programming > languages unify but which cannot be unified in postfix notation, as you > say), and also the negative number syntax. APL, iirc, does indeed use "-" > for both subtraction and negation, i.e. both operators, but the high-minus > was *not* an operator, it was part of a numeric literal, it's how you wrote > "negative three" (as opposed to writing "the negation of three.") Given > APL's strict (lack of) operator precedence, it could be inconvenient to > have to write negative numbers as operators applied to positive numbers, > but a syntax element doesn't have that issue. That's why I compared it to > Lojban's {ni'u} as opposed to {vu'u}. > > > ~mark > > > On 11/3/22 05:27, Alex Plantema via Unicode wrote: > > Op do 03-11-2022 om 02:46 schreef Tim Partridge via Unicode: > > I suspect most programming languages follow the unification of hyphen and > minus on typewriter keyboards which led to early character standards doing > the same. > > Also number formatting and parsing routines use the dual use character for > negative numbers and tend not to recognise U+2122. > > Such a unification isn't possible in postfix notation, unless negations > are replaced by subtractions from zero. > > https://stackoverflow.com/questions/46861254/infix-to-postfix-for-negative-numbers > > -- > Alex. > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From cloos at jhcloos.com Tue Nov 8 15:22:30 2022 From: cloos at jhcloos.com (James Cloos) Date: Tue, 08 Nov 2022 16:22:30 -0500 Subject: U+2212 Minus Sign in Computer Languages? In-Reply-To: <4a112351-a6a1-60d7-ce32-80b35363065e@shoulson.com> (Mark E. Shoulson via Unicode's message of "Thu, 3 Nov 2022 21:07:34 -0400") References: <4a112351-a6a1-60d7-ce32-80b35363065e@shoulson.com> Message-ID: On the subjest of languages recognizing #\MINUS_SIGN (?), what about #\MULTIPLICATION_SIGN (?), #\DIVISION_SIGN (?), #\PLUS-MINUS_SIGN (?), #\MINUS-OR-PLUS_SIGN (?), and a few others, such as #\INFINITY (?) and the like? Just how quickly is the UCS perfusing programming? -JimC -- James Cloos OpenPGP: 0x997A9F17ED7DAEA6 From beckiergb at gmail.com Wed Nov 9 23:26:29 2022 From: beckiergb at gmail.com (Rebecca Bettencourt) Date: Wed, 9 Nov 2022 21:26:29 -0800 Subject: U+2212 Minus Sign in Computer Languages? In-Reply-To: References: <4a112351-a6a1-60d7-ce32-80b35363065e@shoulson.com> Message-ID: HyperTalk (and its offspring AppleScript) supported ?, ?, and ?. (It did not support ? or ?, despite them existing in the MacRoman character set.) -- Rebecca Bettencourt On Tue, Nov 8, 2022 at 1:26 PM James Cloos via Unicode < unicode at corp.unicode.org> wrote: > On the subjest of languages recognizing #\MINUS_SIGN (?), > what about #\MULTIPLICATION_SIGN (?), #\DIVISION_SIGN (?), > #\PLUS-MINUS_SIGN (?), #\MINUS-OR-PLUS_SIGN (?), and a > few others, such as #\INFINITY (?) and the like? > > Just how quickly is the UCS perfusing programming? > > -JimC > -- > James Cloos OpenPGP: 0x997A9F17ED7DAEA6 > -------------- next part -------------- An HTML attachment was scrubbed... URL: From richard.wordingham at ntlworld.com Tue Nov 15 18:36:10 2022 From: richard.wordingham at ntlworld.com (Richard Wordingham) Date: Wed, 16 Nov 2022 00:36:10 +0000 Subject: UTN-11 Normalisation Algorithm Message-ID: <20221116003610.7e8279fe@JRWUBU2> What is the licensing status of the code in UTN #11 to 'normalise' Myanmar codepoint sequences? The description of the ordering rules have combining classes that work similarly to Unicode canonical normalisation, and the algorithm reorganises characters between those with combining class 0 to find a sequence that complies with the ordering requirements. The solution is reported to be unique, but need not exist. The algorithm is: 1) Stably sort the characters by combining class. 2) Fix up the positions of ASAT, SIGN AI and ANUSVARA. The code presented for Part 2 seems to be a misleadingly apparently general way of handling what is a set of special cases. (Art that conceals labour.) I therefore fear that even a translation to a different language would be a derived work under copyright law. So, is a license available for using or changing this code? I am asking because I would like to convert some Mon text from a Unicode-like encoding to a reasonable Unicode encoding. One part of my code will obviously be dependent on the probably font-specific encoding used. I will realise no income from this activity. Incidentally, as there need not be a reordering compliant with the UTN #11 ordering requirements, I presume there is no definitive reorganisation in such cases, and that the term 'normalisation' is actually a misnomer. Richard. From liste at secarica.ro Sat Nov 26 16:51:24 2022 From: liste at secarica.ro (Cristian =?UTF-8?Q?Secar=C4=83?=) Date: Sun, 27 Nov 2022 00:51:24 +0200 Subject: what is the purpose of U+2012 ? Message-ID: <20221127005124.00003aa9@secarica.ro> I can not find a place that describes at least a generic purpose of the U+2012 (figure dash) character. What is/was its purpose at the time it was introduced ? I am aware and/or can find a reason for a lot of other "dashes", but not about this one. These are other dashes that I can explain their purpose or even use frequently a few of them: U+002D (hyphen-minus) U+00AD (soft hyphen) U+2010 (hyphen) U+2011 (non-breaking hyphen) U+2013 (en dash) U+2014 (em dash) U+2015 (horizontal bar) U+2043 (hyphen bullet) U+2212 (minus sign) What is U+2012 for ? The term "figure" means nothing to me (but forgive the ignorance if might be otherwise obvious). Cristi -- Cristian Secar? https://www.secarica.ro From jukkakk at gmail.com Sat Nov 26 17:24:37 2022 From: jukkakk at gmail.com (Jukka K. Korpela) Date: Sun, 27 Nov 2022 01:24:37 +0200 Subject: what is the purpose of U+2012 ? In-Reply-To: <20221127005124.00003aa9@secarica.ro> References: <20221127005124.00003aa9@secarica.ro> Message-ID: Cristian Secar? via Unicode wrote: > I can not find a place that describes at least a generic purpose of the > U+2012 (figure dash) character. The Unicode Standard says, in chapter 6: ?U+2012 figure dash has the same (ambiguous) semantic as the U+002D hyphen-minus, but has the same width as digits (if they are monospaced).? So ?figure? means ?digit? here, and U+2012 is suitable for contexts where data containing digits and dashes should be rendered so that a dash takes the same width as a digit. Yucca, https://jkorpela.fi > -------------- next part -------------- An HTML attachment was scrubbed... URL: From aprilop at fn.de Sun Nov 27 01:37:45 2022 From: aprilop at fn.de (Andreas Prilop) Date: Sun, 27 Nov 2022 07:37:45 +0000 Subject: what is the purpose of U+2012 ? In-Reply-To: <20221127005124.00003aa9@secarica.ro> References: <20221127005124.00003aa9@secarica.ro> Message-ID: <5BBE1A39-72CC-4CE5-BAFA-322F7C713FFA@fn.de> On 26 November 2022, Cristian Secar? wrote: > I can not find https://www.google.ro/search?q=%22figure+dash%22 leads to https://en.m.wiktionary.org/wiki/figure_dash https://en.m.wikipedia.org/wiki/Dash#Figure_dash From liste at secarica.ro Sun Nov 27 07:10:02 2022 From: liste at secarica.ro (Cristian =?UTF-8?Q?Secar=C4=83?=) Date: Sun, 27 Nov 2022 15:10:02 +0200 Subject: what is the purpose of U+2012 ? In-Reply-To: <5BBE1A39-72CC-4CE5-BAFA-322F7C713FFA@fn.de> References: <20221127005124.00003aa9@secarica.ro> <5BBE1A39-72CC-4CE5-BAFA-322F7C713FFA@fn.de> Message-ID: <20221127151002.00007781@secarica.ro> ?n data de Sun, 27 Nov 2022 07:37:45 +0000, Andreas Prilop via Unicode a scris: > On 26 November 2022, Cristian Secar? wrote: > > > I can not find > > https://www.google.ro/search?q=%22figure+dash%22 Well, I only searched for unicode u+2012, without using narrative ? sorry I didn't think of that. Cristi -- Cristian Secar? https://www.secarica.ro From jknappen at web.de Wed Nov 30 12:46:28 2022 From: jknappen at web.de (=?UTF-8?Q?J=C3=B6rg_Knappen?=) Date: Wed, 30 Nov 2022 19:46:28 +0100 Subject: Aw: Re: what is the purpose of U+2012 ? In-Reply-To: <20221127151002.00007781@secarica.ro> References: <20221127005124.00003aa9@secarica.ro> <5BBE1A39-72CC-4CE5-BAFA-322F7C713FFA@fn.de> <20221127151002.00007781@secarica.ro> Message-ID: An HTML attachment was scrubbed... URL: From asmusf at ix.netcom.com Wed Nov 30 13:14:17 2022 From: asmusf at ix.netcom.com (Asmus Freytag) Date: Wed, 30 Nov 2022 11:14:17 -0800 Subject: Aw: Re: what is the purpose of U+2012 ? In-Reply-To: References: <20221127005124.00003aa9@secarica.ro> <5BBE1A39-72CC-4CE5-BAFA-322F7C713FFA@fn.de> <20221127151002.00007781@secarica.ro> Message-ID: On 11/30/2022 10:46 AM, J?rg Knappen via Unicode wrote: > this character > seems somewhat redundant. Which is neither here nor there. It's been on the books for over 30 years and before that presumably found in earlier character sets. The presumption would be that there are documents that use it, and that they depend on the properties as stated. That means, some fonts will continue to support it, allowing users to decide to use it for new documents, which then strengthens the need to have continuing font support. That said, it may reflect a typographic practice that isn't widely used today, but perhaps still more widely used than many a symbol from an archaic script? A./ -------------- next part -------------- An HTML attachment was scrubbed... URL: