That's cool! I created an entry for T-PEN in FOSS4Lib (http://foss4lib.org/package/t-pen) so others can more easily find it. (Jim: I also had the FOSS4Lib site send you a login id/password so you can go in and update the T-PEN entry in case I got anything wrong.)
Thanks for the self-promotion!
Peter
On Mar 12, 2013, at 3:10 PM, James Ginther <[log in to unmask]> wrote:
> At the risk of shameless self-promotion, I would suggest an alternative to
> the attempt at using OCR for handwriting. My field of research focuses on
> pre-modern manuscripts which, to no one's surprise, have resisted any OCR
> method. One solution is to create an environment that makes transcribing
> an effective and efficient task. To that end, here at Saint Louis
> University, we built a web-based app called T-PEN. T-PEN attempts to
> identify the location of each line on a digital surrogate and then displays
> it with a text box underneath to ensure accurate transcription.
>
> The URL is t-pen.org. It's free for anyone. In addition to the
> repositories that have given us access, users can upload private images to
> work with.
>
> I know that this solution is not ideal for large sets of handwritten texts,
> but T-PEN does support crowd-sourcing (what we call public projects). You
> can also encode as you transcribe and then export the transcription as an
> XML document (and you can even export transcriptions in OAC currently as
> RDF/XML).
>
> There is introductory video at
> http://www.youtube.com/watch?feature=player_embedded&v=_81fJbOpTcE.
>
> Jim
>
>
>
> On Tue, Mar 12, 2013 at 2:00 PM, Kyle Banerjee <[log in to unmask]>wrote:
>
>> If it's for a discrete project, I'd say scan what you need OCR'd and put it
>> on Mechanical Turk
>>
>> kyle
>>
>>
>> On Tue, Mar 12, 2013 at 10:56 AM, Donna Campbell <[log in to unmask]>
>> wrote:
>>
>>> On a related note, I am looking for a recommendation for software that
>>> provides OCR for handwriting (print and/or cursive). To clarify, this
>>> would be pen ink on paper not digital ink.
>>>
>>> Thank you,
>>> Donna R. Campbell
>>> Technical Services & Systems Librarian
>>> (215) 935-3872 (phone)
>>> (267) 295-3641 (fax)
>>> Mailing Address (via USPS):
>>> Westminster Theological Seminary Library
>>> P.O. Box 27009
>>> Philadelphia, PA 19118 USA
>>> Shipping Address (via UPS or FedEx):
>>> Westminster Theological Seminary Library
>>> 2960 W. Church Rd.
>>> Glenside, PA 19038 USA
>>>
>>> -----Original Message-----
>>> From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of
>>> Eric Lease Morgan
>>> Sent: Tuesday, March 12, 2013 11:57 AM
>>> To: [log in to unmask]
>>> Subject: [CODE4LIB] web-based ocr
>>>
>>> Does anybody here know of a Web-based OCR program or Web service?
>>>
>>> Many people want to do OCR against digitized texts. We all know of
>> various
>>> OCR applications (Adobe Acrobat, ABBYY FineReader, Google's Tesseract,
>>> etc.), but they are not necessarily Web-based. As a service to my
>>> university, I thought it might be cool (or "kewl") to support an image to
>>> text application. Go to Web form. Submit one or more image files. Have
>> OCR
>>> done against them no matter how dirty the output. Return plain text. As a
>>> bonus, the application would support a REST-ful API.
>>>
>>> Does anybody know of something like this that exists already?
>>>
>>> --
>>> Eric Lease Morgan
>>> University of Notre Dame
--
Peter Murray
Assistant Director, Technology Services Development
LYRASIS
[log in to unmask]
+1 678-235-2955
800.999.8558 x2955
|