Are you aware that there is an existing, yet embryonic, FLOSS project
called FreeCite?
http://www.freecite.org/
Mark Matienzo
Applications Developer, NYPL Labs
The New York Public Library
On Fri, Sep 12, 2008 at 2:25 PM, jean rainwater
<[log in to unmask]> wrote:
> Please help us beta test "FreeCite", a new citation parser for
> non-structured bibliographic data. FreeCite is the result of
> collaboration between the Brown University Library and Public Display,
> a Providence-based software company founded by and employing many
> Brown grads. Public Display's core business is information
> extraction. Partial funding for this project was provided by the
> Andrew W. Mellon Foundation.
>
> FreeCite is implemented in Ruby on Rails and uses the CRF++ library
> implementation of conditional random fields. The model is trained on
> the CORA dataset with lexical augmentation from the Directory of
> Research and Researchers at Brown (DRR-B). The API and code are
> available at: http://freecite.library.brown.edu.
>
> Jean Rainwater
> Co-Leader, Integrated Technology Services
> Brown University Library
> Providence, RI 02912
> 401.863.9031
> [log in to unmask]
>
|