Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

Shawn Steele via Unicode unicode at
Tue May 30 17:51:30 CDT 2017

> Until TUS 3.1, it was legal for UTF-8 parsers to treat the sequence <C0 AF> as U+002F.

Sort of, maybe.  It was not legal for them to generate it though.  So you could kind of infer that it was not a legal sequence.


More information about the Unicode mailing list