Are you aware that there is an existing, yet embryonic, FLOSS project called FreeCite? http://www.freecite.org/ Mark Matienzo Applications Developer, NYPL Labs The New York Public Library On Fri, Sep 12, 2008 at 2:25 PM, jean rainwater <[log in to unmask]> wrote: > Please help us beta test "FreeCite", a new citation parser for > non-structured bibliographic data. FreeCite is the result of > collaboration between the Brown University Library and Public Display, > a Providence-based software company founded by and employing many > Brown grads. Public Display's core business is information > extraction. Partial funding for this project was provided by the > Andrew W. Mellon Foundation. > > FreeCite is implemented in Ruby on Rails and uses the CRF++ library > implementation of conditional random fields. The model is trained on > the CORA dataset with lexical augmentation from the Directory of > Research and Researchers at Brown (DRR-B). The API and code are > available at: http://freecite.library.brown.edu. > > Jean Rainwater > Co-Leader, Integrated Technology Services > Brown University Library > Providence, RI 02912 > 401.863.9031 > [log in to unmask] >