Unicode Expert's way of Writing Data Specifications?

Doug Ewell doug at ewellic.org
Wed Jun 10 10:50:55 CDT 2015


Costello, Roger L. <costello at mitre dot org> wrote:

> 1. What are the codepoints of these symbols: FLTPATH? Presumably you
> mean U+0046 U+004C U+0054 U+0050 U+0041 U+0054 U+0048.

I would specify, in prose or ABNF, that all keywords are encoded as
Basic Latin characters (or Basic Latin plus Latin-1, or whatever range
is desired).

This would then apply to all subsequent specifications that deal with
keywords, so there should be no need to specify U+xxxx code points in
each one.

If you use ABNF to specify the syntax, you can take advantage of
keywords like ALPHA and DIGIT in the core rules (RFC 5234, Section B.1),
which are predefined to be Basic Latin.

--
Doug Ewell | http://ewellic.org | Thornton, CO ����




More information about the Unicode mailing list