XML and UTF-8 BOM. [Was Re: Using Template Toolkit and UTF-8]
Matt Sergeant
msergeant at startechgroup.co.uk
Thu Jan 19 13:09:09 GMT 2006
On Thu, 19 Jan 2006, Aaron Crane wrote:
> Steve Sims writes:
> > These saved files have been generated by TT as UTF-8 but those files
> > do not contain a BOM
>
> There wasn't really meant to be any such thing as a "UTF-8 BOM", and
> there are situations in which it's harmful. (It's not clear that XML
> documents are well-formed if their first three bytes are 0xef 0xbb 0xbf
> and they contain an XML declaration, for example.)
Not so. You can even read the XML::SAX::PurePerl code for processing BOMs
which looks for this before checking for XML content. It's even talked
about in the XML spec, IIRC.
Matt.
______________________________________________________________________
This email has been scanned by the MessageLabs Email Security System.
For more information please visit http://www.messagelabs.com/email
______________________________________________________________________
More information about the london.pm
mailing list