Print

Print


quick followup on the thread..

github:  I looked at the cooperhewitt collection but don't see a way to
download the content...I could copy and paste their content but that may
not be the best approach for my files...documentation is thin, seems i
would have to provide email addresses for those seeking access...but
clearly that is not the case with how the cooperhewitt archive is
configured..

My primary concern has been to make it as simple a process as possible for
libraries which have limited technical expertise.  One of the reasons I
made a career change was my inability as a library director to integrate
very useful online resources in the library's content discovery system.
 Each of the libraries I led lacked expertise and/or the technical support
necessary to do so.  So, quit my job, re-tooled and now working
independently.

Internet Archive:  I did a search that included a query term MARC and found
the Open Library and this may be the best option but I will have to include
a field in each record I think...something I could easilydo...the marc
records do download nicely...I'll send a message for guidance on this

Eric's suggestion regarding MIME type is interesting as well but seems I
would have to have a recognizable type like zip...would prefer to have the
files no larger than 4000 or so records to facilitate processing...there
are also some content libraries may not want...eg, erotic literature,
juvenile content..

found the file for comparison with GP generated MARC:

=LDR  00945nam a22002535  4500
=001  PG27384
=006  m||||||||d||||||||
=007  cr||n\|||muaua
=008  \\\\\\s2008\\\\utu|||||o|||||||||||eng\d
=042  \\$adc
=090  \\$aPQ
=092  \0$aeBooks
=100  1\$aDumas, Alexandre,$d1802-1870.
=240  14$aUne fille du régent.$lEnglish
=245  14$aThe Regent's Daughter$h[electronic resource] /$cAlexandre Dumas.
=260  \\$aSalt Lake City :$bProject Gutenberg Literary Archive
Foundation,$c2008.
=300  \\$a1 online resource :$bmultiple file formats.
=500  \\$aRecords generated from Project Gutenberg RDF data.
=540  \\$aApplicable license:$uhttp://www.gutenberg.org/license
=600  10$aOrléans, Philippe,$cduc d',$d1674-1723$vFiction.
=651  \0$aFrance$xHistory$yRegency, 1715-1723$vFiction.
=655  \0$aElectronic books.
=710  2\$aProject Gutenberg.
=856  40$uhttp://www.gutenberg.org/etext/27384$zClick to access.

Gutenberg Project MARC:

=LDR  00721cam a22002293a 4500
=001  27384
=003  PGUSA
=008  081202s2008||||xxu|||||s|||||000\|\eng\d
=040  \\$aPGUSA$beng
=042  \\$adc
=050  \4$aPQ
=100  1\$aDumas, Alexandre, 1802-1870
=245  10$a$h[electronic resource] /$cby Alexandre, 1802-1870 Dumas
=260  \\$bProject Gutenberg,$c2008
=500  \\$aProject Gutenberg
=506  \\$aFreely available.
=516  \\$aElectronic text
=653  \0$aFrance -- History -- Regency, 1715-1723 -- Fiction
=653  \0$aOrléans, Philippe, duc d', 1674-1723 -- Fiction
=830  \0$aProject Gutenberg$v27384
=856  40$uhttp://www.gutenberg.org/etext/27384
=856  42$uhttp://www.gutenberg.org/license$3Rights

thanks again,
dana


On Wed, Jun 12, 2013 at 6:19 PM, Dana Pearson <[log in to unmask]>wrote:

> Kevin, Eric
>
> 7zip worked fine to unzip and records look pretty good since they used 653
> and preserved the string from the metadata element with the hypens.
>  However the records do not do subfield "d" in 100 or 700 fields and thus
> such content appears in the 245$c.  245$a seems to go missing with some
> frequency.  MarcEdit does not report any errors though.
>
> My original intent was just to keep my XSLT skills sharp while I had some
> free time last August.  After creating the stylesheet, I then had no free
> time until January when I could devote 2 or 3 hours to the post transform
> editing.  Thought I'd just dive in but the pool was much deeper than I had
> anticipated.
>
> Do think libraries will prefer my edited versions although different in
> non-access points as well.  Incidentally, not many additions since my
> harvest.
>
> First record in the Project Gutenberg produced records:
>
> =LDR  00721cam a22002293a 4500
> =001  27384
> =003  PGUSA
> =008  081202s2008||||xxu|||||s|||||000\|\eng\d
> =040  \\$aPGUSA$beng
> =042  \\$adc
> =050  \4$aPQ
> =100  1\$aDumas, Alexandre, 1802-1870
> =245  10$a$h[electronic resource] /$cby Alexandre, 1802-1870 Dumas
> =260  \\$bProject Gutenberg,$c2008
> =500  \\$aProject Gutenberg
> =506  \\$aFreely available.
> =516  \\$aElectronic text
> =653  \0$aFrance -- History -- Regency, 1715-1723 -- Fiction
> =653  \0$aOrléans, Philippe, duc d', 1674-1723 -- Fiction
> =830  \0$aProject Gutenberg$v27384
> =856  40$uhttp://www.gutenberg.org/etext/27384
> =856  42$uhttp://www.gutenberg.org/license$3Rights
>
> couldn't readily find the above item but here's an example of my records
> by the same author.
>
> =LDR  01002nam a22002535  4500
> =001  PG18997
> =006  m||||||||d||||||||
> =007  cr||n\|||muaua
> =008  \\\\\\s2006\\\\utu|||||o|||||||||||eng\d
> =042  \\$adc
> =090  \\$aPQ
> =092  \0$aeBooks
> =100  1\$aDumas, Alexandre,$d1802-1870.
> =245  14$aThe Vicomte de Bragelonne$h[electronic resource] :$bOr Ten Years
> Later being the completion of "The Three Musketeers" And "Twenty Years
> After" /$Alexandre Dumas.
> =260  \\$aSalt Lake City :$bProject Gutenberg Literary Archive
> Foundation,$c2006.
> =300  \\$a1 online resource :$bmultiple file formats.
> =500  \\$aRecords generated from Project Gutenberg RDF data.
> =540  \\$aApplicable license:$uhttp://www.gutenberg.org/license
> =650  \0$aAdventure stories.
> =650  \0$aHistorical fiction.
> =651  \0$aFrance$vHistory$yLouis XIV, 1643-1715$vFiction.
> =655  \0$aElectronic books.
> =710  2\$aProject Gutenberg.
> =856  40$uhttp://www.gutenberg.org/etext/18997$zClick to access.
>
> thanks for your interest..
>
> regards,
> dana
>
>
> On Wed, Jun 12, 2013 at 9:10 AM, Ford, Kevin <[log in to unmask]> wrote:
>
>> Hi Dana,
>>
>> Out of curiosity, how does your crosswalk differ from Project Gutenberg's
>> MARC files?  See, e.g.:
>>
>>
>> http://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs#MARC_Records_.28automatically_generated.29
>>
>> Yours,
>> Kevin
>>
>> --
>> Kevin Ford
>> Network Development and MARC Standards Office
>> Library of Congress
>> Washington, DC
>>
>>
>>
>> > -----Original Message-----
>> > From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of
>> > Dana Pearson
>> > Sent: Tuesday, June 11, 2013 9:24 PM
>> > To: [log in to unmask]
>> > Subject: [CODE4LIB] best way to make MARC files available to anyone
>> >
>> > I have crosswalked the Project Gutenberg RDF/DC metadata to MARC.  I
>> > would like to make these files available to any library that is
>> > interested.
>> >
>> > I thought that I would put them on my website via FTP but don't know if
>> > that is the best way.  Don't have an ftp client myself so was thinking
>> > that that may be now passé.
>> >
>> > I tried using Google Drive with access available via the link to two
>> > versions of the files, UTF8 and MARC8.  However, it seems that that is
>> > not a viable solution.  I can access the files with the URLs provided
>> > by setting the access to anyone with the URL but doesn't work for some
>> > of those testing it for me or with the links I have on my webpage..
>> >
>> > I have five folders with files of about 38 MB total.  I have separated
>> > the ebooks, audio books, juvenile content, miscellaneous and non-Latin
>> > scripts such as Chinese, Modern Greek.  Most of the content is in the
>> > ebook folder.
>> >
>> > I would like to make access as easy as possible.
>> >
>> > Google Drive seems to work for me.  Here's the link to my page with the
>> > links in case you would like to look at the folders.  Works for me but
>> > not for everyone who's tried it.
>> >
>> > http://dbpearsonmlis.com/ProjectGutenbergMarcRecords.html
>> >
>> > thanks,
>> > dana
>> >
>> > --
>> > Dana Pearson
>> > dbpearsonmlis.com
>>
>
>
>
> --
> Dana Pearson
> dbpearsonmlis.com
>



-- 
Dana Pearson
dbpearsonmlis.com