New CJK characters

James Kass jameskass at code2001.com
Tue Nov 2 22:34:47 CDT 2021


On 2021-11-03 1:09 AM, Abraham Gross via Unicode wrote:
> Q: What would the specifics of such a system look like behind the scenes?
> A: I'm not sure yet, but I think Wenlin's CDL (http://guide.wenlininstitute.org/wenlin4.3/Character_Description_Language) would be a good place to start.

This web page gives an overview of some of the approaches:
https://everything.explained.today/Chinese_character_description_languages/

Wenlin's approach is quite sophisticated and has been around for a 
while.  A quick web search didn't turn up any previous proposals for 
getting Wenlin's CDL enshrined in Unicode, although Richard Cook has 
submitted various encoding proposals over the years.  If Wenlin 
personnel never floated any CDL-related proposal, it may be that they 
themselves consider such an approach to be out of scope for plain text.

As many of us know, Andrew West maintains a list of IDS for encoded Han 
characters, available here:
https://www.babelstone.co.uk/CJK/index.html
Using IDS to generate glyphs on the fly might be workable, although such 
an approach might well be relegated to a higher level protocol.  
Meanwhile an IDS can already be stored and exchanged in a standard 
fashion.  Counting how many of any IDS for an as yet unencoded ideograph 
exist in plain text might help to establish usage for future encoding 
consideration.

Ken Whistler crunched some numbers about CJK additions here:
https://www.unicode.org/mail-arch/unicode-ml/y2018-m03/0023.html

Additional information about CJK proliferation can be found here:
https://www.babelstone.co.uk/Blog/2007/07/cjk-unified-ideographs-to-infinity-and.html



More information about the Unicode mailing list