Thank you all for the replies.
- Tim Spalding offered LibraryThing's database at
- Roy Tennant pointed at MIT's Barton dump: available at
but the winner is probably this python script based on Ed's suggestion:
from urllib import urlopen
from pymarc import MARCReader
for part in range(1, 30):
for record in MARCReader(urlopen(locrecordspattern % part)):
if record['020'] and record['020']['a']:
Now if I could only figure out how to install "easy_install" on FC8 so
I didn't have to run it with:
env PYTHONPATH=`pwd`/pymarc-2.21 ./readloc.py
On Tue, Apr 29, 2008 at 8:20 AM, Ed Summers <[log in to unmask]> wrote:
> You could download a snapshot of the full LC back file at the Internet
> Archive (kindly donated by Scriblio).
> Then run a script using your favorite MARC parsing library (mine
> currently is pymarc):
> from pymarc import MARCReader
> for record in MARCReader(file('part01.dat')):
> if record['020'] and record['020']['a']:
> print record['020']['a']
> On Mon, Apr 28, 2008 at 9:35 AM, Godmar Back <[log in to unmask]> wrote:
> > Hi,
> > for an investigation/study, I'm looking to obtain a representative
> > sample set (say a few hundreds) of ISBNs. For instance, the sample
> > could represent LoC's holdings (or some other acceptable/meaningful
> > population in the library world).
> > Does anybody have any pointers/ideas on how I might go about this?
> > Thanks!
> > - Godmar