Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8
Richard Wordingham via Unicode
unicode at unicode.org
Wed May 31 00:47:46 CDT 2017
On Tue, 30 May 2017 16:38:45 -0600
Karl Williamson via Unicode <unicode at unicode.org> wrote:
> Under Best Practices, how many REPLACEMENT CHARACTERs should the
> sequence <ED B0 82> generate? 0, 1, 2, 3, 4 ?
>
> In practice, how many do parsers generate?
See Markus Kuhn's test page
http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt, test
5.1.5. Firefox generates three replacement characters.
Richard.
More information about the Unicode
mailing list