> On May 12, 2022, at 2:40 PM, Danielle Reay <[log in to unmask]> wrote:
> We have a faculty member looking to create a dataset from an annotated
> bibliography she compiled. Right now it exists as a word file and as a pdf.
> The entries are relatively structured with a citation and an abstract, but
> the document is about 150 pages long with multiple entries per page. Rather
> than manually copy and paste everything to create the spreadsheet/csv, I
> wanted to ask for suggestions or approaches to doing this by either
> scraping or extracting structured data from the pdf. Thanks very much in
Sent from a mobile device with a crappy on screen keyboard and obnoxious "autocorrect"