PDF to CSV?

David Dorward david at dorward.me.uk
Thu Dec 12 13:39:17 GMT 2013


On 12 Dec 2013, at 12:41, DAVID HODGKINSON wrote:

> Not sure what you're trying to tell me here. It can read PDF? What?

PDF files do have plain text in them, it just just wrapped in markup, 
control characters and binary blobs (for things like embedded images and 
fonts).

It's possible that the data you want can be extracted from them by 
finding the appropriate bit of text in the file and using the code 
around it as a match in Template::Extract.


-- 
David Dorward
http://dorward.co.uk/


More information about the london.pm mailing list