Print

Print


Michael,

For institutions that catalog digital objects in MARC or link to digital surrogates as UT Arlington does, my recommendation is to use the 856 as follows:

856 41 $u http://www.uta.edu/library/ccon/images/thumbs/00384Thumb.jpg $3 thumbnail image
856 41 $u http://www.uta.edu/library/ccon/mrsid_images/ccon/00384.sid $3 access image
856 42 $u http://libraries.uta.edu/ccon/scripts/ShowMap.asp?accession=00384 $3 Cartographic Connections web site

If there were a finding aid, it would go in as

856 42 $u http://library.uta.edu/findingAids/maps.jsp $3 finding aid

There have been conservations on the AUTOCAT list about the subfield 3; there's no controlled vocabulary or even best practices for how to use it, which makes it very difficult to use as a guide to what exactly you're linking to. We're working on a formal set of best practices for digitization projects in Texas that will include a recommendation similar to this.

From a set of 856s like this, I can create a stylesheet to display the thumbnail image and link out to the website appropriately in our statewide image search tool Texas Heritage Online. I access UT Arlington's collections over Z39.50, btw -- see http://www.texasheritageonline.org/search.tkl?focus=target-utar-ccon.tkl&cclquery=map&offset=1. 

Having HTML tags in the MARC is unnecessary, and might break things in "normal" catalog displays. What I need most is consistency so that I don't have to figure out every possible variation for every possible system, which gets a bit old.

Danielle Cunniff Plumer, Coordinator
Texas Heritage Digitization Initiative
Texas State Library and Archives Commission
512.463.5852 (phone) / 512.936.2306 (fax)
[log in to unmask]

-----Original Message-----
From: Code for Libraries [mailto:[log in to unmask]]On Behalf Of
Doran, Michael D
Sent: Sunday, June 21, 2009 5:09 PM
To: [log in to unmask]
Subject: Re: [CODE4LIB] HTML mark-up in MARC records


Hi Stuart,

> A couple of quick questions:

I'd be glad to answer, but I suspect these really only have relevance *after* the main issue ("Is embedding HTML mark-up in MARC records a good/bad idea?") is decided.  ;-)

> (1) When you say "HTML" which version of HTML are you using?

For the HTML markup in the record, there's obviously no version explicitly specified.  Some <img> tags have an end tag (i.e. <img src="URL" />), so could be said to conform to XHTML 1.0, others have no end tag, so are generic HTML.  The ILS in question declared pages to be "HTML 4.0 Transitional" in older versions of the online catalog but HTML standards compliance was wishful thinking.  The current version declares pages to be "HTML 4.01 Transitional" and comes a lot closer to conforming.

This does bring up the issue, though, of the potential for a mis-match in conformation to a  declared DOCTYPE between the HTML mark-up in the record, and the online opac's HTML mark-up. 

> (2) What tool are you using to validate the HTML inside the MARC?

None that I am aware of.  (Note I'm not in the cataloging department, so am not familiar with all their workflow.)

> (3) Since HTML can use character encodings that MARC doesn't understand, how are you escaping the non-ASCII characters in the HTML?

I'm not sure what you are asking here.  I'm not aware of any HTML elements and/or attributes that contain non-ASCII characters.  Perhaps you are referring to data (or perhaps attribute values) rather than to the HTML mark-up code.  Our MARC records are encoded in Unicode UTF-8, so potentially any character can be represented.  For display of the data on the web, the online catalog is declaring that character set in a meta tag: <META http-equiv="Content-Type" content="text/html; charset=UTF-8">.

-- Michael

# Michael Doran, Systems Librarian
# University of Texas at Arlington
# 817-272-5326 office
# 817-688-1926 mobile
# [log in to unmask]
# http://rocky.uta.edu/doran/
________________________________________
From: Code for Libraries [[log in to unmask]] On Behalf Of stuart yeates [[log in to unmask]]
Sent: Sunday, June 21, 2009 4:05 PM
To: [log in to unmask]
Subject: Re: [CODE4LIB] HTML mark-up in MARC records

Doran, Michael D wrote:
> Is anybody else embedding HTML mark-up code in MARC records [1]?  We're currently including an "<img>" tag in some MARC Holdings records in the 856z [2].   I'm inclined to think that HTML mark-up does not belong anywhere in MARC records, but am looking for other opinions (preferably with the reasoning behind the opinions), both pro and con.

A couple of quick questions:

(1) When you say "HTML" which version of HTML are you using?
(2) What tool are you using to validate the HTML inside the MARC?
(3) Since HTML can use character encodings that MARC doesn't understand,
how are you escaping the non-ASCII characters in the HTML?

cheers
stuart
--
Stuart Yeates
http://www.nzetc.org/       New Zealand Electronic Text Centre
http://researcharchive.vuw.ac.nz/     Institutional Repository