character set detection?

Dominic Mitchell dom at
Sun Jan 7 11:15:42 GMT 2007

Dirk Koopman wrote:
> Is there a way of, reasonably reliably, determining what the character
> set of a lump of text is?

Not really, no.  Like Jesse said, Encode::Guess might be a good start.

If you want to do what the browser does, the algorithm is described here:

There's a python implementation of it as well.


More information about the mailing list