Canonical block names: spaces vs. underscores
mathias at qiwi.be
Thu May 26 08:44:51 CDT 2016
> On 26 May 2016, at 10:17, Mathias Bynens <mathias at qiwi.be> wrote:
> `Blocks.txt` (http://unicode.org/Public/UNIDATA/Blocks.txt) lists blocks such as `Cyrillic Supplement`.
> However, `PropertyValueAliases.txt` (http://unicode.org/Public/UNIDATA/PropertyValueAliases.txt) refers to this block as `Cyrillic_Supplement`, with an underscore instead of a space.
> Which is it?
> If proper canonical block names use spaces instead of underscores, why doesn’t `PropertyValueAliases.txt` reflect that?
> If proper canonical block names use underscores instead of spaces, why doesn’t `Blocks.txt` reflect that?
Another example: `Blocks.txt` has `Superscripts and Subscripts`, whereas `PropertyValueAliases.txt` has `Superscripts_And_Subscripts`. Note that in addition to the underscores, the case of the `A` changed as well. Which is the canonical name?
The same goes for other blocks with “and” in the name, e.g. `Miscellaneous Symbols and Pictographs`, `Supplemental Symbols and Pictographs`, etc.
More information about the Unicode