Karen,
*
- Identities in WorldCat are based on literary warrant, i.e., names for
people who authored/edited something or were subject in someone else's
literary work. Personal names in WikiPedia are not entered according to
literary warrant. Nor is their form vetted according to NAF.
- Ludvig van Beethoven doesn't need much disambiguation. Nor does Mark
Twain.
- So yes, Karen/Ralph/Tom -- how exactly is Wikipedia used for
disambiguation? are you certain it's used for THAT purpose? if yes, can you
send us a proper example? Morris, William seems a good example.
Ya'aqov
*
*
*
*
*
*On Thu, May 19, 2011 at 3:48 PM, Graham Seaman <[log in to unmask]>wrote:
*
>
> *Hi Karen
>
> Thanks for the code. As far as I can see though it doesn't actually
> solve my disambiguation problem - since identity_info.php just takes a
> name as input, it can't guess which of the people with this name is
> meant other than by using the most commonly referenced one, which in the
> OCLC data actually seems to often be an amalgam of several people with
> the name; for example
>
> http://worldcat.org/identities/viaf-DNB|100804799
>
> is William Morris, the 18th century African-American engineer whose most
> widely held works include News from Nowhere, Introduction to Fly
> Fishing, and Ancient Slavery Disapproved of by God - ie an amalgamation
> of the various most famous people known by this name.
>
> I guess this is just a hard problem overall.
>
> Graham
> *
> *
>
>
> On 05/19/11 14:56, Karen Coombs wrote:
> > Graham,
> >
> > I'd advocate using WorldCat Identities to get to the appropriate url
> > for dbpedia. Each Identity record has a wikipedia element in it that
> > you could use to link to either Wikipedia or dbpedia.
> >
> > If you want to see an example of this in action you can check out the
> > Author Info demo I did for code4lib 2010 here -
> >
> http://www.librarywebchic.net/mashups/author_info/info_about_this_author.php?OCLCNum=32939031
> >
> > The code for this demo is available for download at -
> > http://www.worldcat.org/devnet/code/devnetDemos/trunk/
> >
> > You'll want the author_info folder and identity_info.php
> >
> > Karen
> >
> > Karen A. Coombs
> > Product Manager
> > OCLC Developer Network
> > [log in to unmask]
> >
> >
> > On Thu, May 19, 2011 at 4:40 AM, graham <[log in to unmask]> wrote:
> >> I need to be able to take author data from a catalogue record and use it
> >> to look up the author on Wikipedia on the fly. So I may have birth date
> >> and possibly year of death in addition to (one spelling of) the name,
> >> the title of one book the author wrote etc.
> >>
> >> I know there are various efforts in progress that will improve the
> >> current situation, but as things stand at the moment what is the best*
> >> way to do this?
> >>
> >> 1. query wikipedia for as much as possible, parse and select the best
> >> fitting result
> >>
> >> 2. go via dbpedia/freebase and work back from there
> >>
> >> 3. use VIAF and/or OCLC services
> >>
> >> 4. Other?
> >>
> >> (I have no experience of 2-4 yet :-(
> >>
> >>
> >> Thanks
> >> Graham
> >> * 'best' being constrained by:
> >> - need to do this in real-time
> >> - need to avoid dependence on services which may be taken away
> >> or charged for
> >> - being able to justify to librarians as reasonably accurate :-)
> >>
> *
>
*
--
ya'aqovZISO | [log in to unmask] | 856 217 3456
*
|