Hi Sergio,
As part of eXtensible Catalog we developed a Dewey module for Drupal,
which takes a Dewey number, and use OCLC's dewey.info to fetch the
textual description of the part. When it was created the service
contained only 3 levels of the classification system, since then they
went ahead, and now it is deeper.
You can find the sorce here:
http://cgit.drupalcode.org/xc/tree/xc_dewey/xc_dewey.module?h=7.x-1.x
Maybe it helps you.
Regarding to UDC: it is much a harder task, and when I worked with it,
I run into a blocking problem, which is that UDC was not licenced as
freely usable, and I was not able to get a licence to use it in an
open source project. There were some other problems as well: UDC
changed from time to time, and sometimes it means, that a given
classification code means this thing in a given point of time, and
that thing some years later. The MARC catalog I worked with did not
contain any information about the UDC versions, so the accuracy of the
tool was not guaranted (of course you can do some intelligent
guessing). And the last problem was, that on contrary to the Dewey
classification UDC contains sometime very lengthy descriptions instead
of one or two words. Semantically it is OK, but makes the UI design a
little bit hard, and if you want to search for the textual
description, you'll end up sometimes with a "noisy" result set.
Otherwise to handle the operators, the subclasses, and all the nice
things UDC provides is a very interesting challange.
Cheers,
Péter
2015-06-12 12:59 GMT+02:00 Sergio Letuche <[log in to unmask]>:
> thank you very much for your quick reply, dear Stefano,
>
> i appreciate it
>
> 2015-06-12 13:47 GMT+03:00 Stefano Bargioni <[log in to unmask]>:
>
>> Hi, Sergio:
>> maybe this article [1 abstract] [2 English text] can give you some basic
>> ideas. We added a lot of DDC info in our Koha catalog two years ago.
>> HTH. Stefano
>>
>> [1] http://leo.cineca.it/index.php/jlis/article/view/8766
>> [2] http://leo.cineca.it/index.php/jlis/article/view/8766/8060
>>
>> On 12/giu/2015, at 12:03, Sergio Letuche <[log in to unmask]> wrote:
>>
>> > hello community!
>> >
>> > we are facing this challenging issue. We need to complete for a vast
>> amount
>> > of records, the dewey, UDC info, has anyone had any experience with this?
>> > We need some way (via modeling? mahout?) to try and discover these
>> values,
>> > based on some text, found in the records' metadata, and then auto
>> complete
>> > these values.
>> >
>> > I would appreciate any feedback, if there is any opensource tool you have
>> > used for this purpose, or if you are aware of any best practice for doing
>> > this task.
>> >
>> > Best
>> >
>>
>>
>> __________________________________________________
>> Il tuo 5x1000 al Patronato di San Girolamo della Carita' e' un gesto
>> semplice ma di grande valore.
>> Una tua firma aiutera' i sacerdoti ad essere piu' vicini alle esigenze di
>> tutti noi.
>> Aiutaci a formare sacerdoti e seminaristi provenienti dai 5 continenti
>> indicando nella dichiarazione dei redditi il codice fiscale 97023980580.
>>
--
Péter Király
software developer
GWDG, Göttingen - Europeana - eXtensible Catalog - The Code4Lib Journal
http://linkedin.com/in/peterkiraly
|