Jérôme Étévé jerome.eteve at gmail.com
Thu Dec 12 11:12:01 GMT 2013

pdftotext (from poppler-utils) does a good job at extracting text from
PDFs, the rest should be text munching :)

Ideally you'd want to target information directly in the PDF
structure. I've got the feeling that's not easily done.


On 12 December 2013 10:47, Dave Hodgkinson <davehodg at gmail.com> wrote:
> I'm about to hit CPAN, but any wisdom from you lovely people
> would be nice!
> I've got bank statements in PDF from Barclays. Would it be easy
> to produce a CSV of the statement parts from them?
> What's the go-to PDF module?

Jerome Eteve

More information about the london.pm mailing list