You can get anything you want
At Brewster Kahle's restaurant.
On Wed, Jan 11, 2012 at 10:55 AM, LeVan,Ralph <[log in to unmask]> wrote:
> http://staff.oclc.org/~levan/PearsTraining/scifi.usmarc has 10,000 marc
> records in it. They are part of the old SiteSearch system that OCLC
> released as open source. They date back to 2002 and will not contain
> any Unicode, if you were hoping to include that as part of your testing.
> -----Original Message-----
> From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of
> Alexander Johannesen
> Sent: Wednesday, January 11, 2012 5:36 AM
> To: [log in to unmask]
> Subject: Open datasets
> I'm in the middle of creating a meta data management system (including
> merging and persistent identifier management) for a somewhat different
> domain (intranets and business integration), but it's based on Topic
> and so is well suited to other means of meta data handling / mangling.
> also going to be open-source, and it might be well-suited to library
> as well.
> So in order to test the integrity and performance of my system so far
> wondering if there's a suitable open dataset of bibliographic records
> aren't too obscure (meaning, I can find the titles at amazon or Open
> Library) that you could recommend? More than 1000 records, but less than
> million, maybe?