Nathan Vack wrote: > Hey cats, > > I'm starting to think (very excitedly) about the Lucene session, and > realized that I'd better get our data into an XML form, so I can do > interesting things with it. > > Anyone here have experience (or code I could steal) dumping data from > Voyager into... anything? I'm happy working in PHP, Java, Ruby, or > perl -- though happiest, probably, in Ruby. Nate, it's pretty easy. Once you dump your records into a giant marc file, you can run marc2xml (http://search.cpan.org/~kados/MARC-XML-0.82/bin/marc2xml). Then run an XSLT against the marcxml file to create your SOLR xml docs. One thing I am hoping that can come out of the preconference is a standard XSLT doc. I sat down with my metadata librarian to develop our XSLT doc -- determining what fields are to be searchable what fields should be left out to help speed up results, etc. It's pretty easy, I think you will be amazed how fast you can have a functioning system with very little effort. Andrew