Do you know of any researcher or scholar in the realm of public health or medicine that may need/want to read the flood of scholarship being generated by Covid-19?
As you may or may not know, the Distant Reader is designed to read large amounts of narrative texts, such as scholarly journal articles. The Gates Foundation, the Allen Institute for AI, and their friends have made freely available a data set of 13,000 full text scholarly articles on the topic of covid-19. [1]
I have downloaded the data set and fed it to the Reader, and the initial results are here:
https://carrels.distantreader.org/library/covid-19/
The results are okay, but they can be improved in a number of ways. For example, I can easily create a full text (Solr) index to the data set. I can create a network diagram illustrating the relationship of a given word to other nearby words. I could apply various types of machine learning to the Reader's output, such as topic modeling and classification, to look for patterns and anomalies.
To do some of these things additional resources may be needed, such as data processing power, data visualization skills, as well as some cyber infrastructure. I have been in touch with my XSEDE colleagues at IU, and they seem more than amenable to help, but the whole thing would be GREATLY improved and MUCH MORE relevant if we were working with somebody who has specific questions to answer -- somebody from the fields of public health, medicine, etc.
Do you know the names of anybody in public health, medicine, or some other discipline who might want to read -- use & understand -- the literature being generated?
Be safe.
[1] data set - https://pages.semanticscholar.org/coronavirus-research
--
Eric Morgan
University of Notre Dame
|