Print

Print


Google has an image search function.  
I has already spidered a huge number of sites.
It honors a site's robots.txt directives (wget can be made to ignore robots.txt).  
Results can be limited to a specific site or domain.

https://www.google.com/search?q=site:nd.edu&tbm=isch

-Andy

-----Original Message-----
From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of Eric Lease Morgan
Sent: Monday, November 18, 2013 1:53 PM
To: [log in to unmask]
Subject: [CODE4LIB] flip book

I want to create a sort of flip book with the images from a website. How?

Here's the idea. Suppose, just like a book, I could flip through the pages of a website to peruse the images it contains. How could I create this? Hmmm... How about:

  1. mirror the website locally (with to something like wget)
  2. traverse the mirror looking for image (*.jpeg, *.gif, *.png, etc.) files
  3. create a list of images and URLs where they reside
  4. present a Web interface allowing the reader to "flip" through list, and when they
  5. find an image of interest allow them to click on it to see the image in context

Such a tool would provide means for thoroughly browsing a website.

-
Eric Lease Morgan