LISTSERV 16.5 - CODE4LIB Archives

That's cool!  I created an entry for T-PEN in FOSS4Lib (http://foss4lib.org/package/t-pen) so others can more easily find it.  (Jim: I also had the FOSS4Lib site send you a login id/password so you can go in and update the T-PEN entry in case I got anything wrong.)

Thanks for the self-promotion!


Peter

On Mar 12, 2013, at 3:10 PM, James Ginther <[log in to unmask]> wrote:
> At the risk of shameless self-promotion, I would suggest an alternative to
> the attempt at using OCR for handwriting. My field of research focuses on
> pre-modern manuscripts which, to no one's surprise, have resisted any OCR
> method.  One solution is to create an environment that makes transcribing
> an effective and efficient task. To that end, here at Saint Louis
> University, we built a web-based app called T-PEN.  T-PEN attempts to
> identify the location of each line on a digital surrogate and then displays
> it with a text box underneath to ensure accurate transcription.
> 
> The URL  is t-pen.org. It's free for anyone. In addition to the
> repositories that have given us access, users can upload private images to
> work with.
> 
> I know that this solution is not ideal for large sets of handwritten texts,
> but T-PEN does support crowd-sourcing (what we call public projects).  You
> can also encode as you transcribe and then export the transcription as an
> XML document (and you can even export  transcriptions in OAC currently as
> RDF/XML).
> 
> There is introductory video at
> http://www.youtube.com/watch?feature=player_embedded&v=_81fJbOpTcE.
> 
> Jim
> 
> 
> 
> On Tue, Mar 12, 2013 at 2:00 PM, Kyle Banerjee <[log in to unmask]>wrote:
> 
>> If it's for a discrete project, I'd say scan what you need OCR'd and put it
>> on Mechanical Turk
>> 
>> kyle
>> 
>> 
>> On Tue, Mar 12, 2013 at 10:56 AM, Donna Campbell <[log in to unmask]>
>> wrote:
>> 
>>> On a related note, I am looking for a recommendation for software that
>>> provides OCR for handwriting (print and/or cursive). To clarify, this
>>> would be pen ink on paper not digital ink.
>>> 
>>> Thank you,
>>> Donna R. Campbell
>>> Technical Services & Systems Librarian
>>> (215) 935-3872 (phone)
>>> (267) 295-3641 (fax)
>>> Mailing Address (via USPS):
>>> Westminster Theological Seminary Library
>>> P.O. Box 27009
>>> Philadelphia, PA 19118  USA
>>> Shipping Address (via UPS or FedEx):
>>> Westminster Theological Seminary Library
>>> 2960 W. Church Rd.
>>> Glenside, PA 19038  USA
>>> 
>>> -----Original Message-----
>>> From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of
>>> Eric Lease Morgan
>>> Sent: Tuesday, March 12, 2013 11:57 AM
>>> To: [log in to unmask]
>>> Subject: [CODE4LIB] web-based ocr
>>> 
>>> Does anybody here know of a Web-based OCR program or Web service?
>>> 
>>> Many people want to do OCR against digitized texts. We all know of
>> various
>>> OCR applications (Adobe Acrobat, ABBYY FineReader, Google's Tesseract,
>>> etc.), but they are not necessarily Web-based. As a service to my
>>> university, I thought it might be cool (or "kewl") to support an image to
>>> text application. Go to Web form. Submit one or more image files. Have
>> OCR
>>> done against them no matter how dirty the output. Return plain text. As a
>>> bonus, the application would support a REST-ful API.
>>> 
>>> Does anybody know of something like this that exists already?
>>> 
>>> --
>>> Eric Lease Morgan
>>> University of Notre Dame



--
Peter Murray
Assistant Director, Technology Services Development
LYRASIS
[log in to unmask]
+1 678-235-2955
800.999.8558 x2955