UTF-8 display (was: Re: a mug)

Marcel Schneider charupdate at orange.fr
Wed Jul 22 01:38:42 CDT 2015


On 21 Jul 2015, at 16;00, Tom Gewecke  wrote:

> The IBM page seems to have an ellipsis character in UTF-8, with bytes E2 80 A6. The web server is set to force all browsers to use the encoding iso-8859-1 regardless of what charset is stipulated in the html code. The browser uses the Win 1252 equivalents and displays …
> 
> To see what a web server is forcing, if anything, you can use
> 
> http://web-sniffer.net/


Thank you. So the file i get when saving the page is a modified one. The workaround is then, if I understand well, to let web-sniffer check whether the server is forcing an unconsistent encoding:
| Content-Type: text/html;charset=ISO-8859-1
Then save the page...
| meta http-equiv="Content-Type" content="text/html; charset=windows-1252"
...and reset the charset to the value shown in the source code:
| meta http-equiv="Content-Type" content="text/html; charset=utf-8"
Then open this.
That's very useful!

Have a great day,

Marcel
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://unicode.org/pipermail/unicode/attachments/20150722/7d46b03f/attachment.html>


More information about the Unicode mailing list