Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

Doug Ewell via Unicode unicode at unicode.org
Wed May 17 16:18:15 CDT 2017


Hans Åberg wrote:

>> Far from solving the stated problem, it would introduce a new one:
>> conversion from the "bad data" Unicode code points, currently
>> well-defined, would become ambiguous.
>
> Actually not: just translate the invalid UTF-8 sequences into invalid
> UTF-32.

Far from solving the stated problem, it would introduce TWO new ones...
 
--
Doug Ewell | Thornton, CO, US | ewellic.org




More information about the Unicode mailing list