Not a huge set, but I will offer up our BibApp data, It should mostly meet your requirements and is already in VIVO form (and other flavors) for you.

Contact me off-list of you have questions.

Jason Stirnaman


------ Original message ------
From: Paul Albert
Date: 7/9/2013 10:33 AM
To: [log in to unmask];
Subject:[CODE4LIB] Anyone have access to well-disambiguated sets of publication data?

I am exploring methods for author disambiguation, and I would like to have access to one or more set of well-disambiguated data set containing:
– a unique author identifier (email address, institutional identifier)
– a unique article identifier (PMID, DOI, etc.)
– a unique journal identifier (ISSN)

Definition for "well-disambiguated" – for a given set of authors, you know the identity of their journal articles to a precision and recall of greater than 90-95%.

Any ideas?


Paul Albert
Project Manager, VIVO
Weill Cornell Medical Library