Print

Print


Jonathan, 

Karen is correct -- CR/LF are invalid characters within a MARC record.  This has nothing to do if the character is valid in the set -- the format itself doesn't allow it.

--TR

-----Original Message-----
From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of Jonathan Rochkind
Sent: Thursday, May 19, 2011 11:29 AM
To: [log in to unmask]
Subject: Re: [CODE4LIB] is this valid marc ?

I wonder if it depends on if your record is in Marc8 or UTF-8, if I'm reading Karen right to say that CR/LF aren't in the Marc8 character set. 
They're certainly in UTF-8!  And a Marc record can be in UTF-8.

On 5/19/2011 2:27 PM, Jonathan Rochkind wrote:
> Is it really true that newline characters are not allowed in a marc 
> value?  I thought they were, not with any special meaning, just as 
> ordinary data.  If they're not, that's useful to know, so I don't put 
> any there!
>
> I'd ask for a reference to the standard that says this, but I suspect 
> it's going to be some impenetrable implication of a side effect of an 
> subtle adjective either way.
>
> On 5/19/2011 2:19 PM, Karen Coyle wrote:
>> Quoting Andreas Orphanides <[log in to unmask]>:
>>
>>>
>>> Anyway, I think having these two parts of the same URL data on 
>>> separate lines is definitely Not Right, but I am not sure if it adds 
>>> up to invalid MARC.
>>
>> Exactly. The CR and LF characters are NOT defined as valid in the 
>> MARC character set and should not be used. In fact, in MARC there is 
>> no concept of "lines", only variable length strings (usually up to
>> 9999 char).
>>
>> kc
>>
>>>
>>> -dre.
>>>
>>> [1] http://www.loc.gov/marc/bibliographic/bd856.html
>>> [2] I am not a cataloger. Don't hurt me.
>>> [3] I am not an expert on MARC ingest or on ruby-marc. I could be 
>>> wrong.
>>>
>>> On 5/19/2011 12:37 PM, James Lecard wrote:
>>>> I'm using ruby-marc ruby parser (v.0.4.2) to parse some marc files 
>>>> I get from a partner.
>>>>
>>>> The 856 field is splitted over 2 lines, causing the ruby library to 
>>>> ignore it (I've patched it to overcome this issue) but I want to 
>>>> know if this kind of marc is valid ?
>>>>
>>>> =LDR  00638nam  2200181uu 4500
>>>> =001  cla-MldNA01
>>>> =008  080101s2008\\\\\\\|||||||||||||||||fre||
>>>> =040  \\$aMy Provider
>>>> =041  0\$afre
>>>> =245  10$aThis Subject
>>>> =260  \\$aParis$bJ. Doe$c2008
>>>> =490  \\$aSome topic
>>>> =650  1\$aNarratif, Autre forme
>>>> =655  \7$abook$2lcsh
>>>> =752  \\$aA Place on earth
>>>> =776  \\$dParis: John Doe and Cie, 1973
>>>> =856  \2$qtext/html
>>>> =856
>>>> \\$uhttp://www.this-link-will-not-be-retrieved-by-ruby-marc-library
>>>>
>>>> Thanks,
>>>>
>>>> James L.
>>>
>>
>>
>>