I was wondering if anyone has worked with the Elsevier EFFECT41 datasets (http://info.sciencedirect.com/techsupport/sdos/effect41.pdf)
I've written a parser which mostly works fine then discovered their notion of "continuation lines" (e.g. any line which does not start with a "tag" is a continuation of the previous line). I could continue working on what I've got and try to figure out how to handle the continuation lines, but I was hoping that someone might have already done something that joins continuation lines. Please followup to [log in to unmask]<mailto:[log in to unmask]>
|