character set detection?
dom at happygiraffe.net
Sun Jan 7 11:15:42 GMT 2007
Dirk Koopman wrote:
> Is there a way of, reasonably reliably, determining what the character
> set of a lump of text is?
Not really, no. Like Jesse said, Encode::Guess might be a good start.
If you want to do what the browser does, the algorithm is described here:
There's a python implementation of it as well.
More information about the london.pm