Print

Print


On Jun 5, 2015, at 8:10 AM, Eric Lease Morgan <[log in to unmask]> wrote:

> Does anybody here have experience reading the SGML/XML files representing the content of EEBO? 

I ultimately found the EEBO files in the form of TEI, and then I was able to transform one of them into VERY functional HTML5. Coolness! Here’s the recipe:

 1. download P5 from Box [1]
 2. download stylesheets from GitHub [2]
 3. transform using Saxon [3]
 4. save output to HTTP server 
 5. open in browser [4]
 6. read results AND get scanned image

Nice clean data + fully functional stylesheets = really cool output

[1] P5 - http://bit.ly/1QcvxLP
[2] stylesheets - https://github.com/TEIC/Stylesheets
[3] transform - java -cp saxon9he.jar net.sf.saxon.Transform -t -s:/var/www/html/sandbox/eebo-tcp/xml/A0/A06567.xml -xsl:/var/www/html/sandbox/eebo-tcp/style/html5/html5.xsl > /var/www/html/tmp/eebo.html
[4] output - http://dh.crc.nd.edu/tmp/eebo.html

—
ELM