wiki scraping

Chris Benson chrisb at jesmond.demon.co.uk
Thu Feb 28 16:26:29 GMT 2008


On Thu, Feb 28, 2008 at 02:46:55PM +0000, Nic Gibson wrote:
> Does that sound sane? Is there some little tool lurking somewhere that can
> do any of this for me? Have I missed an obvious solution?

If you don't mind them looking like web-pages-printed-out then what I
use is the venerable htmldoc: http://www.easysw.com/htmldoc/
(But I'm using the:
	"A limited-use open source version is also available at
	http://www.htmldoc.org/"
)

I spider a dozen or so usemod wikis every night, stuffing the URIs into
config files for htmldoc. Not pretty, but not a lot of work.
-- 
Chris Benson


More information about the london.pm mailing list