I have created and made available a newer version of my HathiTrust Research Center Workset Browser, a tool to do “distant” and “scalable” reading against content downloadable from the HathiTrust: https://github.com/ndlib/text-analysis-htrc The tool is designed specifically to work Research Center “data capsules”, and like all software, it is never done. Give it a whirl? — Eric Morgan