wiki scraping
Nic Gibson
nicg at corbas.net
Thu Feb 28 16:16:15 GMT 2008
On 2/28/08, IvorW <combobulus at xemaps.com> wrote:
>
>
> Reasonably sane. If there are any feeds available, such as RDF, RSS or
> Atom, this may help assist you getting to raw data and ignoring the
> formatting. Still, format=text may give you this.
>
> See http://search.cpan.org/~ivorw/OpenGuides-RDF-Reader/ in particular
> the og_mirror script that comes with it, for where I used this for
> OpenGuides.
>
> Ivor.
>
Ahhh, that's a large chunk of my coding done for me then :)
Given that I have the xslt for doing the FOP bit, life is looking easier
thanks
nic
--
Nic Gibson
Director, Corbas Consulting
Editorial and Technical Consultancy
http://www.corbas.co.uk/
More information about the london.pm
mailing list