UTS#51 and emoji-sequences.txt
Yifán Wáng via Unicode
unicode at unicode.org
Fri Jun 8 21:54:57 CDT 2018
When I'm looking at
https://unicode.org/Public/emoji/11.0/emoji-sequences.txt
It goes on line 16 that:
----------
# type_field: any of {Emoji_Combining_Sequence, Emoji_Flag_Sequence,
Emoji_Modifier_Sequence}
# The type_field is a convenience for parsing the emoji sequence
files, and is not intended to be maintained as a property.
----------
This field, however, actually contains "Emoji_Keycap_Sequence" and
"Emoji_Tag_Sequence", instead of "Emoji_Combining_Sequence" (it was
already so in 5.0).
And I go back to
http://www.unicode.org/reports/tr51/
Under the section 1.4.6:
----------
ED-21. emoji keycap sequence set — The specific set of emoji sequences
listed in the emoji-sequences.txt file [emoji-data] under the category
Emoji_Keycap_Sequence.
ED-22. emoji modifier sequence set — The specific set of emoji
sequences listed in the emoji-sequences.txt file [emoji-data] under
the category Emoji_Modifier_Sequence.
ED-23. RGI emoji flag sequence set — The specific set of emoji
sequences listed in the emoji-sequences.txt file [emoji-data] under
the category Emoji_Flag_Sequence.
ED-24. RGI emoji tag sequence set — The specific set of emoji
sequences listed in the emoji-sequences.txt file [emoji-data] under
the category Emoji_Tag_Sequence.
----------
I'm not sure if the "category" means "type_field" or headings in the
txt file, as the headings do not contain underscores. If it means
"type_field", then the description of type_field above is wrong.
Also the section 1.4.5:
----------
ED-14c. emoji keycap sequence — A sequence of the following form:
emoji_keycap_sequence := [0-9#*] \x{FE0F 20E3}
- These characters are in the emoji-sequences.txt file listed under
the category Emoji_Keycap_Sequence
----------
While in the previous version (rev. 12):
----------
ED-14c. emoji keycap sequence — An emoji combining sequence of the
following form:
emoji_keycap_sequence := [0-9#*] \x{FE0F 20E3}
- These characters are in the emoji-sequences.txt file listed under
the category Emoji_Combining_Keycap_Sequence
----------
It seems there was some kind of confusion on terms, but anyway, isn't
the last line of ED-14c redundant with the current revision? (Or
"Emoji_Combining_Sequence" is intended?)
Thank you.
Wang Yifan
More information about the Unicode
mailing list