Print

Print


Kevin, Eric

7zip worked fine to unzip and records look pretty good since they used 653
and preserved the string from the metadata element with the hypens.
 However the records do not do subfield "d" in 100 or 700 fields and thus
such content appears in the 245$c.  245$a seems to go missing with some
frequency.  MarcEdit does not report any errors though.

My original intent was just to keep my XSLT skills sharp while I had some
free time last August.  After creating the stylesheet, I then had no free
time until January when I could devote 2 or 3 hours to the post transform
editing.  Thought I'd just dive in but the pool was much deeper than I had
anticipated.

Do think libraries will prefer my edited versions although different in
non-access points as well.  Incidentally, not many additions since my
harvest.

First record in the Project Gutenberg produced records:

=LDR  00721cam a22002293a 4500
=001  27384
=003  PGUSA
=008  081202s2008||||xxu|||||s|||||000\|\eng\d
=040  \\$aPGUSA$beng
=042  \\$adc
=050  \4$aPQ
=100  1\$aDumas, Alexandre, 1802-1870
=245  10$a$h[electronic resource] /$cby Alexandre, 1802-1870 Dumas
=260  \\$bProject Gutenberg,$c2008
=500  \\$aProject Gutenberg
=506  \\$aFreely available.
=516  \\$aElectronic text
=653  \0$aFrance -- History -- Regency, 1715-1723 -- Fiction
=653  \0$aOrléans, Philippe, duc d', 1674-1723 -- Fiction
=830  \0$aProject Gutenberg$v27384
=856  40$uhttp://www.gutenberg.org/etext/27384
=856  42$uhttp://www.gutenberg.org/license$3Rights

couldn't readily find the above item but here's an example of my records by
the same author.

=LDR  01002nam a22002535  4500
=001  PG18997
=006  m||||||||d||||||||
=007  cr||n\|||muaua
=008  \\\\\\s2006\\\\utu|||||o|||||||||||eng\d
=042  \\$adc
=090  \\$aPQ
=092  \0$aeBooks
=100  1\$aDumas, Alexandre,$d1802-1870.
=245  14$aThe Vicomte de Bragelonne$h[electronic resource] :$bOr Ten Years
Later being the completion of "The Three Musketeers" And "Twenty Years
After" /$Alexandre Dumas.
=260  \\$aSalt Lake City :$bProject Gutenberg Literary Archive
Foundation,$c2006.
=300  \\$a1 online resource :$bmultiple file formats.
=500  \\$aRecords generated from Project Gutenberg RDF data.
=540  \\$aApplicable license:$uhttp://www.gutenberg.org/license
=650  \0$aAdventure stories.
=650  \0$aHistorical fiction.
=651  \0$aFrance$vHistory$yLouis XIV, 1643-1715$vFiction.
=655  \0$aElectronic books.
=710  2\$aProject Gutenberg.
=856  40$uhttp://www.gutenberg.org/etext/18997$zClick to access.

thanks for your interest..

regards,
dana


On Wed, Jun 12, 2013 at 9:10 AM, Ford, Kevin <[log in to unmask]> wrote:

> Hi Dana,
>
> Out of curiosity, how does your crosswalk differ from Project Gutenberg's
> MARC files?  See, e.g.:
>
>
> http://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs#MARC_Records_.28automatically_generated.29
>
> Yours,
> Kevin
>
> --
> Kevin Ford
> Network Development and MARC Standards Office
> Library of Congress
> Washington, DC
>
>
>
> > -----Original Message-----
> > From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of
> > Dana Pearson
> > Sent: Tuesday, June 11, 2013 9:24 PM
> > To: [log in to unmask]
> > Subject: [CODE4LIB] best way to make MARC files available to anyone
> >
> > I have crosswalked the Project Gutenberg RDF/DC metadata to MARC.  I
> > would like to make these files available to any library that is
> > interested.
> >
> > I thought that I would put them on my website via FTP but don't know if
> > that is the best way.  Don't have an ftp client myself so was thinking
> > that that may be now passé.
> >
> > I tried using Google Drive with access available via the link to two
> > versions of the files, UTF8 and MARC8.  However, it seems that that is
> > not a viable solution.  I can access the files with the URLs provided
> > by setting the access to anyone with the URL but doesn't work for some
> > of those testing it for me or with the links I have on my webpage..
> >
> > I have five folders with files of about 38 MB total.  I have separated
> > the ebooks, audio books, juvenile content, miscellaneous and non-Latin
> > scripts such as Chinese, Modern Greek.  Most of the content is in the
> > ebook folder.
> >
> > I would like to make access as easy as possible.
> >
> > Google Drive seems to work for me.  Here's the link to my page with the
> > links in case you would like to look at the folders.  Works for me but
> > not for everyone who's tried it.
> >
> > http://dbpearsonmlis.com/ProjectGutenbergMarcRecords.html
> >
> > thanks,
> > dana
> >
> > --
> > Dana Pearson
> > dbpearsonmlis.com
>



-- 
Dana Pearson
dbpearsonmlis.com