XML and UTF-8 BOM. [Was Re: Using Template Toolkit and UTF-8]

Matt Sergeant msergeant at startechgroup.co.uk
Thu Jan 19 13:09:09 GMT 2006

On Thu, 19 Jan 2006, Aaron Crane wrote:

> Steve Sims writes:
> > These saved files have been generated by TT as UTF-8 but those files  
> > do not contain a BOM
> There wasn't really meant to be any such thing as a "UTF-8 BOM", and
> there are situations in which it's harmful.  (It's not clear that XML
> documents are well-formed if their first three bytes are 0xef 0xbb 0xbf
> and they contain an XML declaration, for example.)

Not so. You can even read the XML::SAX::PurePerl code for processing BOMs 
which looks for this before checking for XML content. It's even talked 
about in the XML spec, IIRC.


This email has been scanned by the MessageLabs Email Security System.
For more information please visit http://www.messagelabs.com/email 

More information about the london.pm mailing list