Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

Wed May 17 16:18:15 CDT 2017

Hans Åberg wrote:

>> Far from solving the stated problem, it would introduce a new one:
>> conversion from the "bad data" Unicode code points, currently
>> well-defined, would become ambiguous.
>
> Actually not: just translate the invalid UTF-8 sequences into invalid
> UTF-32.

Far from solving the stated problem, it would introduce TWO new ones...

--
Doug Ewell | Thornton, CO, US | ewellic.org