Feedback on the proposal to change U+FFFD generation when decoding ill-formed UTF-8

Hans Åberg via Unicode unicode at unicode.org
Tue May 16 13:20:00 CDT 2017


> On 16 May 2017, at 20:01, Philippe Verdy <verdy_p at wanadoo.fr> wrote:
> 
> On Windows NTFS (and LFN extension of FAT32 and exFAT) at least, random sequences of 16-bit code units are not permitted. There's visibly a validation step that returns an error if you attempt to create files with invalid sequences (including other restrictions such as forbidding U+0000 and some other problematic controls).

For it to work the way I suggested, there would be low level routines that handles the names raw, and then on top of that, interface routines doing what you describe. On the Austin Group List, they mentioned a filesystem doing it directly in UTF-16, and it could have been the one you describe.





More information about the Unicode mailing list