Please ,look at the validity of Stefano.mrc: yaz-marcdump is unable to list it.
Anyway, I'm going to send you the code you can adapt to your goal (I hope).
Sorry for the delay...
sb
On 15/giu/2015, at 11:42, Sergio Letuche <[log in to unmask]> wrote:
> Dear Stefano,
>
> i really need your help with this, since i am on a tight deadline, and i do
> not think i will manage this by myself alone...
>
> Tried to make something out of your description in the paper, but without
> success.
>
> Please find attached a small portion of our db, where you may find in field
> 011$a the ISSN, and we need to make two things, with the existing records:
>
> For starters fill 676$a = DDC, with the most appropriate value,
>
> and secondly, if possible, wherever there is different suggestion by your
> algorithm, compared to the already existing DDC value, a log file,
> something that would allow us to keep record of the already stored and the
> suggested value, in order some librarian's human eye decides on which is
> best.
>
> I would appreciate your help on this,
>
> thank you very much.
>
> p.s. attached the sample mrc from our db. (UNIMARC)
>
>
>
> 2015-06-12 13:47 GMT+03:00 Stefano Bargioni <[log in to unmask]>:
>
>> Hi, Sergio:
>> maybe this article [1 abstract] [2 English text] can give you some basic
>> ideas. We added a lot of DDC info in our Koha catalog two years ago.
>> HTH. Stefano
>>
>> [1] http://leo.cineca.it/index.php/jlis/article/view/8766
>> [2] http://leo.cineca.it/index.php/jlis/article/view/8766/8060
>>
>> On 12/giu/2015, at 12:03, Sergio Letuche <[log in to unmask]> wrote:
>>
>>> hello community!
>>>
>>> we are facing this challenging issue. We need to complete for a vast
>> amount
>>> of records, the dewey, UDC info, has anyone had any experience with this?
>>> We need some way (via modeling? mahout?) to try and discover these
>> values,
>>> based on some text, found in the records' metadata, and then auto
>> complete
>>> these values.
>>>
>>> I would appreciate any feedback, if there is any opensource tool you have
>>> used for this purpose, or if you are aware of any best practice for doing
>>> this task.
>>>
>>> Best
>>>
>>
>>
>> __________________________________________________
>> Il tuo 5x1000 al Patronato di San Girolamo della Carita' e' un gesto
>> semplice ma di grande valore.
>> Una tua firma aiutera' i sacerdoti ad essere piu' vicini alle esigenze di
>> tutti noi.
>> Aiutaci a formare sacerdoti e seminaristi provenienti dai 5 continenti
>> indicando nella dichiarazione dei redditi il codice fiscale 97023980580.
>>
> <Stefano.mrc>
__________________________________________________
Il tuo 5x1000 al Patronato di San Girolamo della Carita' e' un gesto semplice ma di grande valore.
Una tua firma aiutera' i sacerdoti ad essere piu' vicini alle esigenze di tutti noi.
Aiutaci a formare sacerdoti e seminaristi provenienti dai 5 continenti indicando nella dichiarazione dei redditi il codice fiscale 97023980580.
|