Perl 5.16 vs Ruby 2.0 UTF-8 support
gvimrc at gmail.com
Thu Aug 22 17:13:40 BST 2013
On 22/08/2013 16:59, Dave Cross wrote:
> Without seeing your data (or knowing anything much about Ruby's
> string-handling) I'd guess that your file is in one of the extended
> ASCII character sets (probably ISO-8859-1 or cp1252). You haven't told
> Perl to decode the data in any way, so it's just treating it as a stream
> of bytes. Perhaps Ruby defaults to assuming the input is utf8 and tries
> to decode it as such. And then barfs when one of the characters is in
> the range 128-255 - which is invalid for utf8.
> All a guess though.
Great. That makes sense. The character set is ISO-8859-1 but I can't
locate the problematic char.
More information about the london.pm