Corner cases (was: Re: UTF-16 Encoding Scheme and U+FFFE)

Wed Jun 4 13:26:01 CDT 2014

Sorry, I left out an important detail.

I wrote:
> 3. U+FEFF at the beginning of a stream (note: not "packet" or
> arbitrary cutoff point)

I meant U+FEFF as a zero-width no-break space. Obviously it is very
common to see U+FEFF as a signature or BOM.

My underlying question here is, how common is it that the producer of a
stream actually intends this character *at the start of a stream* to be
a ZWNBSP, not to be stripped lest the actual text content be altered?

Doug Ewell | Thornton, CO, USA | @DougEwell

