Web scraping frameworks?

Dave Hodgkinson davehodg at gmail.com
Wed Mar 5 11:31:45 GMT 2014

I've tended to use Parallel::Process where remote sites have been able to
keep up and haven't been throttled, otherwise just let it run.

On Tue, Mar 4, 2014 at 11:49 PM, Kieren Diment <diment at gmail.com> wrote:

> Gearman's fine until you need a reliable queue.  It's certainly less of a
> pain to set up than rabbitmq, but if you start with gearman and find you
> need reliability after a while there's substantial pain to be experienced
> (unless you already know all about your reliable job queue implementation
> of choice).
> On 05/03/2014, at 10:35 AM, Jérôme Étévé wrote:
> > - For queuing jobs, I'm a big fan of Gearman. It's light, very stable
> > and very simple.

More information about the london.pm mailing list