NamesList.txt as data source
Janusz S. Bień
jsbien at mimuw.edu.pl
Sat Mar 26 04:10:24 CDT 2016
On Thu, Mar 10 2016 at 22:40 CET, kenwhistler at att.net writes:
[...]
> The *reason* that NamesList.txt exists at all is to drive the tool, unibook,
> that formats the full Unicode code charts for posting. It is only
> posted in the Unicode Character Database at all as a matter of
> convenience, to give people access to a text only version of the
> names list that appears in the fully formatted pdf versions of the
> code charts
> that contain all the representative glyphs.
>
> NamesList.txt should *not* be data mined.
I've just noticed that NamesList.txt is in a sense data mined by the
Unicode consortium itself. I mean the "Unicode Utilities: Character
Properties", which e.g. for LATIN SMALL LETTER P WITH FLOURISH
(http://unicode.org/cldr/utility/character.jsp?a=A753) display in
particular
subhead: Medievalist addition
Am I right that this information is available only in NamesList.txt?
In my opinion this is important information and should be officially
available for character data mining engines.
Best regards
Janusz
--
,
Prof. dr hab. Janusz S. Bien - Uniwersytet Warszawski (Katedra Lingwistyki Formalnej)
Prof. Janusz S. Bien - University of Warsaw (Formal Linguistics Department)
jsbien at uw.edu.pl, jsbien at mimuw.edu.pl, http://fleksem.klf.uw.edu.pl/~jsbien/
More information about the Unicode
mailing list