Here's the set that I generated a while ago - it's quite big as it covers the full Marc21 field and subfield set for bibliographic records. I'm releasing these under the terms of our Talis Community License. (http://www.talis.com/tdn/tcl) Would people be interested in a write-up of how we've used RadioactiveMarc and automated tests to validate Bath and US National Profile compliance? rob Rob Styles Programme Manager, Data Services, Talis tel: +44 (0)870 400 5000 fax: +44 (0)870 400 5001 direct: +44 (0)870 400 5004 mobile: +44 (0)7971 475 257 msn: [log in to unmask] irc: irc.freenode.net/mmmmmrob,isnick > -----Original Message----- > From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of > Binkley, Peter > Sent: 08 February 2007 21:13 > To: [log in to unmask] > Subject: [CODE4LIB] Radioactive records for Solr > > In hunting for data to help model subject faceting for MARC records, > I've just been looking at Bill Moen's Zinterop report > (http://www.unt.edu/zinterop/ZInterop2/Documents/ZInterop2FinalReport_w > e > m4Dec2005.pdf). It occurs to me that with all our various projects > working on indexing MARC records in Solr, we should set up and > distribute a set of "radioactive records" to use in each project to > diagnose and compare indexing and querying behaviour. Probably we could > just use the Zinterop records (which are described in detail in that > pdf > but aren't available for download anywhere I could find); but we might > want to enhance them with data suitable for testing our faceting > systems. Not sure what that would mean but I thought I'd throw it out. > > If you were at Access '05, you heard Bill describe the Z39.50 testing > he > was doing with radioactive records: records with known unique values in > all indexed fields, that could be used for automated testing of Z39.50 > search functionality. The same approach might be very useful as we feel > our way towards a Solr MARC indexing system. > > Has anyone already done something like this? > > Peter > > Peter Binkley > Digital Initiatives Technology Librarian > Information Technology Services > 4-30 Cameron Library > University of Alberta Libraries > Edmonton, Alberta > Canada T6G 2J8 > Phone: (780) 492-3743 > Fax: (780) 492-9243 > e-mail: [log in to unmask] The very latest from Talis read the latest news at www.talis.com/news listen to our podcasts www.talis.com/podcasts see us at these events www.talis.com/events join the discussion here www.talis.com/forums join our developer community www.talis.com/tdn and read our blogs www.talis.com/blogs Any views or personal opinions expressed within this email may not be those of Talis Information Ltd. The content of this email message and any files that may be attached are confidential, and for the usage of the intended recipient only. If you are not the intended recipient, then please return this message to the sender and delete it. Any use of this e-mail by an unauthorised recipient is prohibited. Talis Information Ltd is a member of the Talis Group of companies and is registered in England No 3638278 with its registered office at Knights Court, Solihull Parkway, Birmingham Business Park, B37 7YB.