We have a faculty member looking to create a dataset from an annotated
bibliography she compiled. Right now it exists as a word file and as a pdf.
The entries are relatively structured with a citation and an abstract, but
the document is about 150 pages long with multiple entries per page. Rather
than manually copy and paste everything to create the spreadsheet/csv, I
wanted to ask for suggestions or approaches to doing this by either
scraping or extracting structured data from the pdf. Thanks very much in
Digital Scholarship Technology Manager