Quarantining crap HTML?

Ruud H.G. van Tol rvtol at isolution.nl
Tue May 21 13:58:23 BST 2013


On 21/05/2013 13:31, Dave Hodgkinson wrote:

> In keeping with the spirit of the list, this isn't directly a perl question
> but it might be part of the solution.
>
> I'm picking up HTML from another site, and that HTML is pretty crappy.
>
> Is there any way of quarantining it so it doesn't bugger up the rest of the
> page?

If plain text suffices, involve lynx/links in -dump mode.

-- 
Ruud



More information about the london.pm mailing list