Surrogates and noncharacters

Hans Aberg haberg-1 at telia.com
Tue May 12 10:58:00 CDT 2015


> On 12 May 2015, at 16:50, Philippe Verdy <verdy_p at wanadoo.fr> wrote:
> 
>> Indeed, that is why UTF-8 was invented for use in Unix-like environments.
>> 
> Not the main reason: communication protocols, and data storage is also based on 8-bit code units (even if storage group them by much larger blocks).

There is some history here:
  https://en.wikipedia.org/wiki/UTF-8#History





More information about the Unicode mailing list