You can see the match algorithm that the Open Library is using at:
http://www.kcoyle.net/temp/merge.html
Much depends on the particular characteristics of your file of records.
The above algorithm assumes MARC21 records. It also is designed to work
with records entering a database because it depends on certain indexed
values to retrieve potential matches.
kc
Emmanuel Di Pretoro wrote:
> Hi,
>
> Is there anybody who is already involved in the process of cleaning a MARC
> file. This means:
> - fusion multiple records into one single record;
> - or keep one record, and delete the others.
>
> Can you describe your methodology, as well as used algorithms.
>
> Thanks in advance.
>
> Regards,
>
> Emmanuel Di Pretoro
>
>
>
--
-----------------------------------
Karen Coyle / Digital Library Consultant
[log in to unmask] http://www.kcoyle.net
ph.: 510-540-7596 skype: kcoylenet
fx.: 510-848-3913
mo.: 510-435-8234
------------------------------------
|