Print

Print


Another reason to check with the webmaster, all legalities aside, is that their top ten list might actually be being built on an RSS feed, but for whatever reason they don't offer it directly as a feed (or they do, but it wasn't obvious to you where that feed was to be found).  They might prefer you grab the feed rather than scrape the screen.  I don't actually have any feed-based pages on our site that aren't also available as feeds -- but some people might.  Also, for usage statistics reasons, I'd rather have bots hitting the feeds instead of the pages.

Genny Engel
Sonoma County Library
[log in to unmask]
707 545-0831 x581
www.sonomalibrary.org


-----Original Message-----
From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of Nate Hill
Sent: Sunday, October 02, 2011 7:23 PM
To: [log in to unmask]
Subject: [CODE4LIB] screen scraping

A question: what are the 'rules' around screen scraping?
If one site doesn't offer an RSS feed and you want to grab (for example)
their weekly top ten list with a script and then redisplay it on another
site, is that bad form?  Or even illegal?
Thanks-
Nate


-- 
Nate Hill
[log in to unmask]
http://www.natehill.net