Corner cases (was: Re: UTF-16 Encoding Scheme and U+FFFE)

Doug Ewell doug at ewellic.org
Wed Jun 4 13:26:01 CDT 2014


Sorry, I left out an important detail.

I wrote:
 
> 3. U+FEFF at the beginning of a stream (note: not "packet" or
> arbitrary cutoff point)

I meant U+FEFF as a zero-width no-break space. Obviously it is very
common to see U+FEFF as a signature or BOM.

My underlying question here is, how common is it that the producer of a
stream actually intends this character *at the start of a stream* to be
a ZWNBSP, not to be stripped lest the actual text content be altered?

--
Doug Ewell | Thornton, CO, USA
http://ewellic.org | @DougEwell




More information about the Unicode mailing list