Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8
Markus Scherer via Unicode
unicode at unicode.org
Tue May 23 12:45:46 CDT 2017
On Tue, May 23, 2017 at 7:05 AM, Asmus Freytag via Unicode <
unicode at unicode.org> wrote:
> So, if the proposal for Unicode really was more of a "feels right" and not
> a "deviate at your peril" situation (or necessary escape hatch), then we
> are better off not making a RECOMMEDATION that goes against collective
> practice.
>
I think the standard is quite clear about this:
Although a UTF-8 conversion process is required to never consume
well-formed subsequences as part of its error handling for ill-formed
subsequences, such a process is not otherwise constrained in how it deals
with any ill-formed subsequence itself. An ill-formed subsequence
consisting of more than one code unit could be treated as a single error or
as multiple errors.
markus
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://unicode.org/pipermail/unicode/attachments/20170523/db7a8587/attachment.html>
More information about the Unicode
mailing list