On Tue, Dec 08, 2009 at 04:11:56PM +0100, Daniel Veillard wrote:
On Tue, Dec 08, 2009 at 02:40:49PM +0000, Daniel P. Berrange wrote:
> ACK, this is a better approach because it works when a browser is pointed
> to the HTML files locally too.
Okay, applied, this should propagate to the web site within an hour or
so,
Well this didn't work, and after some debugging I found the problem.
Out html.in are processed with xsltproc --html option, i.e. using the
HTML parser, with default to ISO latin if out of ascii range and no meta
information is provided, and the resulting .html file was correct
XML (XHTML) in UTF-8 but the initil UTF-8 2 bytes were misinterpreted
in the HTML parsing. The solution is to add a meta header to the
news.html.in file indicating the content is UTF-8, then the proper UTF-8
HTML is generated, and I checked the Web server doesn't add any HTTP
header for the encoding, so now we use UTF-8 from source to destination,
and the problem should be fixed.
Daniel
--
Daniel Veillard | libxml Gnome XML XSLT toolkit
http://xmlsoft.org/
daniel(a)veillard.com | Rpmfind RPM search engine
http://rpmfind.net/
http://veillard.com/ | virtualization library
http://libvirt.org/