Print

Print


> A suggestion: you might want to also add Biblio-Citation-Parser

Added. Keep em coming!

On Tue, Sep 16, 2008 at 12:21 PM, Steve Oberg <[log in to unmask]> wrote:
> A suggestion: you might want to also add Biblio-Citation-Parser by Mike
> Jewell (http://search.cpan.org/~mjewell/Biblio-Citation-Parser-1.10/)<http://search.cpan.org/~mjewell/Biblio-Citation-Parser-1.10/>
> Steve
>
> On Tue, Sep 16, 2008 at 11:11 AM, Miriam Goldberg <[log in to unmask]>wrote:
>
>> Thanks for pointing out these other parsing tools. I've added them to
>> the list on our website (see under heading "Other Citation Tools" at
>> http://freecite.library.brown.edu/).
>>
>> Citation metadata extraction is a difficult open problem whose
>> potential solutions are based on continually-developing technologies.
>> So I think it's important that we approach this task from many diverse
>> angles. If our project makes a little headway here, ParsCit makes some
>> headway there, and five other groups make their own advancements,
>> hopefully we'll be able to pool our findings into a viable
>> application.
>>
>> > Anyone want to compare and contrast these three projects?  Might make a
>> good very
>> > short article/review for the Code4Lib Journal if you wanted to.
>>
>> Agreed. I'd love to see this. Another idea might be to write an
>> application that takes the output of multiple parsers and assembles
>> the best answer.
>>
>> On Fri, Sep 12, 2008 at 3:50 PM, Jonathan Rochkind <[log in to unmask]>
>> wrote:
>> > This is the third open source citation parser I know of now. A welcome
>> change from a year ago when I needed one and didn't know of any! But I can't
>> help but think maybe people should be cooperating more instead of
>> engineering their own wheels. Also curious if anyone has looked at all three
>> and can compare and contrast and make a reccommendation.
>> >
>> > The other two I know about are:
>> >
>> > ParsCit -- http://wing.comp.nus.edu.sg/parsCit/
>> > A CDL project I don't have a good home page for, but code is here:
>> http://gales.cdlib.org/~egh/hmm-citation-extractor/
>> >
>> > I've been keeping track because I have a use for this, although haven't
>> had time to make use of any of them yet.
>> >
>> > Anyone want to compare and contrast these three projects?  Might make a
>> good very short article/review for the Code4Lib Journal if you wanted to.
>> >
>> > Jonathan
>> >
>> >
>> >>>> jean rainwater <[log in to unmask]> 09/12/08 2:25 PM >>>
>> > Please help us beta test "FreeCite", a new citation parser for
>> > non-structured bibliographic data. FreeCite is the result of
>> > collaboration between the Brown University Library and Public Display,
>> > a Providence-based software company founded by and employing many
>> > Brown grads.  Public Display's core business is information
>> > extraction. Partial funding for this project was provided by the
>> > Andrew W. Mellon Foundation.
>> >
>> > FreeCite is implemented in Ruby on Rails and uses the CRF++ library
>> > implementation of conditional random fields. The model is trained on
>> > the CORA dataset  with lexical augmentation from the Directory of
>> > Research and Researchers at Brown (DRR-B). The API and code are
>> > available at: http://freecite.library.brown.edu.
>> >
>> > Jean Rainwater
>> > Co-Leader, Integrated Technology Services
>> > Brown University Library
>> > Providence, RI 02912
>> > 401.863.9031
>> > [log in to unmask]
>> >
>>
>