I'm working on the JISC KB+ project that Tom mentioned.
As part of the project we've been collating journal title lists from various sources. We've been working with members of the KBART steering group and have used KBART where possible, although we've been collecting data not covered by KBART.
All the data we have at this level is published under a CC0 licence at http://www.kbplus.ac.uk/kbplus/publicExport - including a csv that uses the KBART data elements. The focus so far has been on packages negotiated by JISC in the UK - although in many cases the title lists may be the same as are made available in other markets. We also include what we call 'Master lists' which are an attempt to capture the complete list of titles and coverage offered by a content provider. We'd very much welcome any feedback on these exports, and of course be interested to know if anyone makes use of them.
So far a lot of the work on collating/coverting/standardising the data has been done by hand - which is clearly not ideal. In the next phase of the project the KB+ project is going to work with the GoKB project http://gokb.org - as part of this collaboration we are currently working on ways of streamlining the data processing from publisher files or other sources, to standardised data. While we are still working on how this is going to be implemented, we are currently investigating the possibility of using Google/Open Refine to capture and re-run sets of rules across data sets from specific sources. We should be making progress on this in the next couple of months.
Hope that's helpful
Owen
Owen Stephens
Owen Stephens Consulting
Web: http://www.ostephens.com
Email: [log in to unmask]
Telephone: 0121 288 6936
On 16 Oct 2012, at 20:23, Tom Pasley <[log in to unmask]> wrote:
> You might also be interested in the work at http://www.kbplus.ac.uk . The
> site is up at the moment, but I can't reach it for some reason... they have
> a public export page which you might want to know about
> http://www.kbplus.ac.uk/kbplus/publicExport
>
> Tom
>
> On Wed, Oct 17, 2012 at 8:12 AM, Jonathan Rochkind <[log in to unmask]> wrote:
>
>> I think KBART is such an effort. As with most library standards groups,
>> there may not be online documentation of their most recent efforts or
>> successes, but: http://www.uksg.org/kbart
>>
>> http://www.uksg.org/kbart/s5/**guidelines/data_format<http://www.uksg.org/kbart/s5/guidelines/data_format>
>>
>>
>>
>> On 10/16/2012 2:16 PM, Godmar Back wrote:
>>
>>> Hi,
>>>
>>> at our library, there's an emerging need to process title lists from
>>> vendors for various purposes, such as checking that the titles purchased
>>> can be discovered via discovery system and/or OPAC. It appears that the
>>> formats in which those lists are provided are non-uniform, as is the
>>> process of obtaining them.
>>>
>>> For example, one vendor - let's call them "Expedition Scrolls" - provides
>>> title lists for download to Excel, but which upon closer inspection turn
>>> out to be HTML tables. They are encoded using an odd mixture of CP1250 and
>>> HTML entities. Other vendors use entirely different formats.
>>>
>>> My question is whether there are efforts, software, or anything related to
>>> streamlining the acquisition and processing of vendor title lists in
>>> software systems that aid in the collection development and maintenance
>>> process. Any pointers would be appreciated.
>>>
>>> - Godmar
>>>
>>>
>>>
|