Teletext separated mosaic graphics
William_J_G Overington
wjgo_10009 at btinternet.com
Fri Oct 9 10:00:31 CDT 2020
Mark E. Shoulson wrote:
> Isn't that kind of what the Control Pictures block (U+2400) is?
…
> So if you're willing to do that, go ahead and create a program or
> protocol that takes the ordinary graphic characters U+2400 through
> U+2426 and handles them in some special way, creating a new layer on
> top of plain text.
Thank you for the suggestion.
One could indeed use twenty-seven of the characters in the range U+2400
.. U+241F in that manner.
As a short-term solution it is, in my opinion, a bit better than using a
Private Use Area solution, much better than using twenty-seven codes
from the range U+0080 .. U+009F and very much better than using
twenty-seven codes from the range U+0000 . U+001F.
However, for long-term storage and archiving of teletext pages within
documents that contain notes about them, all of those solutions have
problems. They are all essentially markup solutions. I have had similar
issues with one of my inventions where encoding into regular Unicode has
thus far not been achieved as such encoding has been declared out of
scope. Thus I have used various markup solutions in order to make
progress. One is to use an integral sign followed by a sequence of
circled digits. Another markup solution that I am using is an
exclamation mark followed by ordinary digits. Both are effective, but
are not regular Unicode plain text solutions. Maybe one day a regular
Unicode plain text encoding will be possible. The proposal for the
invention to become an international standard is with ISO. There had
been a good chance of a slide show that I produced being presented at a
plenary conference in June 2020, but the conference was cancelled due to
the COVID-19 situation and a virtual meeting replacement has been
difficult to hold because of time zone issues. Maybe if ISO decides to
standardize the invention as an international standard that it will then
be possible to have a rigorous regular Unicode encoding of the codes,
thus providing a plain text unambiguous non-proprietary interoperable
format.
I opine that the elegant long-term solution for the teletext control
characters is to encode twenty-seven codes from a block of thirty-two
code points in plane 14, keeping one-to-one correspondence with the
final five bits of the original teletext control code encoding. They
could be encoded as displayable characters so as to provide a graceful,
helpful, fallback if specialist software to act upon the characters as
if they are teletext control characters is not available.
William Overington
Friday 9 October 2020
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://corp.unicode.org/pipermail/unicode/attachments/20201009/b2afd260/attachment.htm>
More information about the Unicode
mailing list