character set detection?

Ash Berlin ash_cpan at firemirror.com
Sat Jan 6 23:28:26 GMT 2007


Dirk Koopman wrote:
> Is there a way of, reasonably reliably, determining what the character
> set of a lump of text is?
>
>   
In a(n unhelpful) word: No. Not in a 100% reliable way anyway.

Might want to look at http://icu.sourceforge.net/ - it has heuristics to 
do it (I think.)

Ash


More information about the london.pm mailing list