Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

Richard Wordingham via Unicode unicode at unicode.org
Wed May 31 00:47:46 CDT 2017


On Tue, 30 May 2017 16:38:45 -0600
Karl Williamson via Unicode <unicode at unicode.org> wrote:

> Under Best Practices, how many REPLACEMENT CHARACTERs should the 
> sequence <ED B0 82> generate?  0, 1, 2, 3, 4 ?
> 
> In practice, how many do parsers generate?

See Markus Kuhn's test page
http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt, test
5.1.5.  Firefox generates three replacement characters.

Richard.


More information about the Unicode mailing list