Print

Print


You can get anything you want
At Brewster Kahle's restaurant.
http://openlibrary.org/data#bulk_download

Simon

On Wed, Jan 11, 2012 at 10:55 AM, LeVan,Ralph <[log in to unmask]> wrote:

> http://staff.oclc.org/~levan/PearsTraining/scifi.usmarc has 10,000 marc
> records in it.  They are part of the old SiteSearch system that OCLC
> released as open source.  They date back to 2002 and will not contain
> any Unicode, if you were hoping to include that as part of your testing.
>
> Ralph
>
> -----Original Message-----
> From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of
> Alexander Johannesen
> Sent: Wednesday, January 11, 2012 5:36 AM
> To: [log in to unmask]
> Subject: Open datasets
>
> Hiya,
>
> I'm in the middle of creating a meta data management system (including
> merging and persistent identifier management) for a somewhat different
> domain (intranets and business integration), but it's based on Topic
> Maps
> and so is well suited to other means of meta data handling / mangling.
> It's
> also going to be open-source, and it might be well-suited to library
> tasks
> as well.
>
> So in order to test the integrity and performance of my system so far
> I'm
> wondering if there's a suitable open dataset of bibliographic records
> that
> aren't too obscure (meaning, I can find the titles at amazon or Open
> Library) that you could recommend? More than 1000 records, but less than
> a
> million, maybe?
>
> Regards,
>
> Alex
>