Surrogates and noncharacters

Hans Aberg haberg-1 at
Tue May 12 10:58:00 CDT 2015

> On 12 May 2015, at 16:50, Philippe Verdy <verdy_p at> wrote:
>> Indeed, that is why UTF-8 was invented for use in Unix-like environments.
> Not the main reason: communication protocols, and data storage is also based on 8-bit code units (even if storage group them by much larger blocks).

There is some history here:

More information about the Unicode mailing list