On Jun 5, 2015, at 8:10 AM, Eric Lease Morgan <[log in to unmask]> wrote:
> Does anybody here have experience reading the SGML/XML files representing the content of EEBO?
I ultimately found the EEBO files in the form of TEI, and then I was able to transform one of them into VERY functional HTML5. Coolness! Here’s the recipe:
1. download P5 from Box [1]
2. download stylesheets from GitHub [2]
3. transform using Saxon [3]
4. save output to HTTP server
5. open in browser [4]
6. read results AND get scanned image
Nice clean data + fully functional stylesheets = really cool output
[1] P5 - http://bit.ly/1QcvxLP
[2] stylesheets - https://github.com/TEIC/Stylesheets
[3] transform - java -cp saxon9he.jar net.sf.saxon.Transform -t -s:/var/www/html/sandbox/eebo-tcp/xml/A0/A06567.xml -xsl:/var/www/html/sandbox/eebo-tcp/style/html5/html5.xsl > /var/www/html/tmp/eebo.html
[4] output - http://dh.crc.nd.edu/tmp/eebo.html
—
ELM
|