Quarantining crap HTML?
Ruud H.G. van Tol
rvtol at isolution.nl
Tue May 21 13:58:23 BST 2013
On 21/05/2013 13:31, Dave Hodgkinson wrote:
> In keeping with the spirit of the list, this isn't directly a perl question
> but it might be part of the solution.
> I'm picking up HTML from another site, and that HTML is pretty crappy.
> Is there any way of quarantining it so it doesn't bugger up the rest of the
If plain text suffices, involve lynx/links in -dump mode.
More information about the london.pm