On Jun 5, 2015, at 8:10 AM, Eric Lease Morgan <[log in to unmask]> wrote: > Does anybody here have experience reading the SGML/XML files representing the content of EEBO? I ultimately found the EEBO files in the form of TEI, and then I was able to transform one of them into VERY functional HTML5. Coolness! Here’s the recipe: 1. download P5 from Box [1] 2. download stylesheets from GitHub [2] 3. transform using Saxon [3] 4. save output to HTTP server 5. open in browser [4] 6. read results AND get scanned image Nice clean data + fully functional stylesheets = really cool output [1] P5 - http://bit.ly/1QcvxLP [2] stylesheets - https://github.com/TEIC/Stylesheets [3] transform - java -cp saxon9he.jar net.sf.saxon.Transform -t -s:/var/www/html/sandbox/eebo-tcp/xml/A0/A06567.xml -xsl:/var/www/html/sandbox/eebo-tcp/style/html5/html5.xsl > /var/www/html/tmp/eebo.html [4] output - http://dh.crc.nd.edu/tmp/eebo.html — ELM