Corner cases (was: Re: UTF-16 Encoding Scheme and U+FFFE)
Doug Ewell
doug at ewellic.org
Wed Jun 4 13:26:01 CDT 2014
Sorry, I left out an important detail.
I wrote:
> 3. U+FEFF at the beginning of a stream (note: not "packet" or
> arbitrary cutoff point)
I meant U+FEFF as a zero-width no-break space. Obviously it is very
common to see U+FEFF as a signature or BOM.
My underlying question here is, how common is it that the producer of a
stream actually intends this character *at the start of a stream* to be
a ZWNBSP, not to be stripped lest the actual text content be altered?
--
Doug Ewell | Thornton, CO, USA
http://ewellic.org | @DougEwell
More information about the Unicode
mailing list