Private Use areas

Peter Constable via Unicode unicode at
Mon Aug 27 15:20:31 CDT 2018

This was meant to go to the list.

From: Peter Constable
Sent: Monday, August 27, 2018 12:33 PM
To: wjgo_10009 at; jameskasskrv at; richard.wordingham at; mark at; beckiergb at; verdy_p at
Subject: RE: Private Use areas

That sounds like a non-conformant use of characters in the U+24xx block.


From: Unicode <unicode-bounces at<mailto:unicode-bounces at>> On Behalf Of William_J_G Overington via Unicode
Sent: Monday, August 27, 2018 2:00 AM
To: jameskasskrv at<mailto:jameskasskrv at>; richard.wordingham at<mailto:richard.wordingham at>; mark at<mailto:mark at>; beckiergb at<mailto:beckiergb at>; verdy_p at<mailto:verdy_p at>
Cc: unicode at<mailto:unicode at>
Subject: Re: Private Use areas


How about the following method.

In a text file that contains text that uses Private Use Area characters, start the file with a sequence of Enclosed Alphanumeric characters from regular Unicode, that sequence containing the metadata relating to those Private Use Area characters as used in their present context.

Use circled digits U+24EA, U+2460 .. U+2469, and Circled Latin letters U+24B6 .. U+24E9.

Use U+2473 as if it were a circled space. The use of 20 to mean a space often occurs in web addresses. I know that there it is hexadecimal and here it is decimal but it has the same look of being an encoded space and so that is why I am suggesting using it.

Start the sequence with PUAINFO encoded using seven circled Latin letters and any character other than a carriage return or a line feed shows that the sequence has ended. The use of PUAINFO encoded using seven circled Latin letters at the start of the sequence is so that text using enclosed alphanumeric characters for another purpose would not become disrupted.

Then a suitable software application can read the text file and then, either automatically or after the clicking of a button, extract metadata information from the sequence of enclosed alphanumeric characters and not display the sequence of enclosed alphanumeric characters.

Maybe other circled numbers in the range 10 through to 19 would have special meanings.

This method would keep everything within plane zero.

William Overington

Monday 27 August 2018

----Original message----
From : unicode at<mailto:unicode at>
Date : 2018/08/21 - 23:23 (GMTDT)
To : doug at<mailto:doug at>
Cc : unicode at<mailto:unicode at>
Subject : Re: Private Use areas
On Tue, Aug 21, 2018 at 3:02 PM Doug Ewell via Unicode <unicode at<mailto:unicode at>> wrote:
Ken Whistler wrote:

> The way forward for folks who want to do this kind thing is:
> 1. Define a *protocol* for reliable interchange of custom character
> property information about PUA code points.

I've often thought that would be a great idea. You can't get to steps 2
and 3 without step 1. I'd gladly participate in such a project.

As would I.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <>

More information about the Unicode mailing list