In case anyone wants to have a go, here’s the ddc/lcsh data I extracted from the LC 2007 retrospective file [1]:
http://inkdroid.org/data/dewey-lcsh.gz
The file contains ddc/lcsh combinations from 2,909,673 records.
//Ed
[1] https://archive.org/details/marc_records_scriblio_net
On Dec 10, 2013, at 6:11 PM, Roy Tennant <[log in to unmask]> wrote:
> Has anyone looked at using the Classify web service for this? [1] It
> doesn't have a batch mode, but it has a web service [2].
> Roy
>
> [1] http://oclc.org/research/activities/classify.html
> [2] http://classify.oclc.org/classify2/api_docs/index.html
>
>
> On Tue, Dec 10, 2013 at 2:53 PM, Kyle Banerjee <[log in to unmask]>wrote:
>
>> This is my inclination. However, if the algorithm doesn't incorporate
>> values from the tables used to synthesize Dewey numbers, identifying the
>> stems of numbers may be tricky. It might be worth calling up someone at a
>> major Dewey library like UIUC or Northwestern to see if they might be
>> willing to provide data to add to what you get from LC.
>>
>> kyle
>>
>>
>> On Tue, Dec 10, 2013 at 1:18 PM, Edward Summers <[log in to unmask]> wrote:
>>
>>> Not a naive idea at all. If you have the stomach for it, you could
>> extract
>>> the Subject Heading / Dewey combinations out of say the LC Catalog MARC
>>> data [1] to use as training data for some kind of clustering [2]
>> algorithm.
>>> You might even be able to do something simple like keep a count of the
>>> Dewey ranges associated with each subject heading.
>>>
>>> I’m kind of curious myself, so I could work on getting the subject
>> heading
>>> / dewey combinations if you want?
>>>
>>> //Ed
>>>
>>> [1] https://archive.org/details/marc_records_scriblio_net
>>> [2] https://en.wikipedia.org/wiki/Cluster_analysis
>>>
>>> On Dec 10, 2013, at 8:18 AM, Irina Arndt <[log in to unmask]> wrote:
>>>
>>>> Hi CODE4LIB,
>>>>
>>>> we would like to add DDC classes to a bunch of MARC records, which
>>> contains only LoC Subject Headings.
>>>> Does anybody know, if a mapping between LCSH and DDC is anywhere
>>> existent (and available)?
>>>>
>>>> I understood, that WebDewey
>>> http://www.oclc.org/dewey/versions/webdewey.en.html might provide such
>> a
>>> service, but
>>>>
>>>> · we are no OCLC customers or subscribers to WebDewey
>>>>
>>>> · even if we were, I'm not sure, if the service matches our
>> needs
>>>>
>>>> I'm thinking of a tool, where I can upload my list of subject headings
>>> and get back a list, where the matching Dewey classes have been added
>> (but
>>> a 'simple' csv file with LCSH terms and DDC classes would be helpful as
>>> well- I am fully aware, that neither LCSH nor DDC are simple at all...) .
>>> Naïve idea...?
>>>>
>>>> Thanks for any clues,
>>>> Irina
>>>>
>>>>
>>>> -------
>>>>
>>>> Irina Arndt
>>>> Max Planck Digital Library (MPDL)
>>>> Library System Coordinator
>>>> Amalienstr. 33
>>>> D-80799 Muenchen, Germany
>>>>
>>>> Tel. +49 89 38602-254
>>>> Fax +49 89 38602-290
>>>>
>>>> Email: [log in to unmask]<mailto:[log in to unmask]>
>>>> http://www.mpdl.mpg.de
>>>
>>
|