Print

Print


I think you'd have a very hard time demonstrating any speed advantage to MARC over MARCXML. XML parsers have been speed optimized out the wazoo; If there exists a MARC parser that has ever been speed-optimized without serious compromise, I'm sure someone on this list will have a good story about it.

On Oct 25, 2010, at 3:05 PM, Patrick Hochstenbach wrote:

> Dear Nate,
> 
> There is a trade-off: do you want very fast processing of data -> go for binary data. do you want to share your data globally easily in many (not per se library related) environments -> go for XML/RDF. 
> Open your data and do both :-)
> 
> Pat
> 
> Sent from my iPhone
> 
> On 25 Oct 2010, at 20:39, "Nate Vack" <[log in to unmask]> wrote:
> 
>> Hi all,
>> 
>> I've just spent the last couple of weeks delving into and decoding a
>> binary file format. This, in turn, got me thinking about MARCXML.
>> 
>> In a nutshell, it looks like it's supposed to contain the exact same
>> data as a normal MARC record, except in XML form. As in, it should be
>> round-trippable.
>> 
>> What's the advantage to this? I can see using a human-readable format
>> for poorly-documented file formats -- they're relatively easy to read
>> and understand. But MARC is well, well-documented, with more than one
>> free implementation in cursory searching. And once you know a binary
>> file's format, it's no harder to parse than XML, and the data's
>> smaller and processing faster.
>> 
>> So... why the XML?
>> 
>> Curious,
>> -Nate

Eric Hellman
President, Gluejar, Inc.
41 Watchung Plaza, #132
Montclair, NJ 07042
USA

[log in to unmask] 
http://go-to-hellman.blogspot.com/
@gluejar