bob at randomness.org.uk
Mon Apr 22 13:17:27 BST 2013
On Mon, 22 Apr 2013, Roger Bell_West wrote:
> On Mon, Apr 22, 2013 at 11:45:43AM +0100, Mike Whitaker wrote:
>> On a similar subject, what PDF (or even text, assuming I can find something to extract the text on a page by page basis) indexing solutions are there out there in Perl?
> pdftotext and then throw the text at a generic indexing package. I
> keep meaning to do something with Plucene.
Lucy is possibly a better choice if you dont want to just use
Elasticsearch. Since Lucy is actively developed unlike Plucene.
everything should be purple and bendy
More information about the london.pm