On 10/2/12 8:44 AM, Paul Orkiszewski wrote:
> Hi 4libers,
>
> Does anyone know of something - a kiosk, an iPad app, a web application
> - that:
I don't know of anything like it out there, but let's look at what it
might take. I've done some software work in connection with Harvard's
Iranian Oral History Project.
> - Initiates an oral history interview by getting demographic info and
> permission to use and stream for scholarly purposes.
I'm not sure what you're saying here. It sounds as if you're talking
about automated correspondence with the sources. That would be a huge
project in itself, so I assume you've got something more narrowly
focused in mind.
> - Goes through a standard set of questions (in our case stuff about the
> Appalachian State experience)
There are two pieces to this: Recording the responses and storing the
relevant metadata. The recording probably shouldn't be tied to a
specific device or application, since field work can involve a lot of
different conditions. The researcher in the field would want something
to enter the metadata (who, what, when, where); this would be a
straightforward piece.
> - Stores the metadata, permissions release, and pointers to the audio
> files created for each question in a dbase record
You don't say what the scope of the work is; from the way you're putting
the questions, I'm assuming it's a small-scale project with one
researcher doing the interviews and putting the information together.
Even so, It's probably best to have the field work be a separate
application from assembling the information in the database. If nothing
else, once you're at this point there's more standard software that can
be used.
> - Processes the audio through speech recognition either in real time or
> post-interview, and populates the dbase record with rendered text (at
> whatever level of accuracy)
You could do this piece with Dragon; see this post for some discussion:
http://www.nuance.com/dragon/transcription-solutions/index.htm
A friend of mine is an expert in this area and might be able to answer
some questions.
> - Provide a search interface, where the meatadata, demographic info
> (within reasonable privacy limits), and the transcript (however garbled)
> is searchable.
I'd suggest basing something on Apache Lucene.
> - Crowd source the improvement of the transcriptions over time
This needs to be better specified. One solution is to put the text onto
a wiki. If you're talking about integrating it into the application that
does all the rest, it could get messy.
> - Package the interface as an app, and set up a machine image on Amazon
> EC2, such that when someone uses the image and points a browser to it,
> it goes through a set up routine so that smaller schools and historical
> societies can set up their own sites in the cloud. I haven't tried
> streaming on a free tier EC2 server, but you get 30 GB of storage, so
> you could get a fair number of hours of audio (depending on the
> settings) before you have to start paying.
This, I assume, is why you're talking about treating the whole thing as
a single application. Putting it all together would be a huge chunk of
work. Dragon's software isn't free, and I don't know of anything for
free that does decent speech transcription, so that would be a stumbling
block to making it available to other institutions.
>
> ?
>
> Anyone interested in trying it with me if there's nothing already out
> there? I'm leaning toward iPad, so we'd need iOS, server admin, dbase,
> and media expertise. I have newbie-but-getting-better skill in the last
> 3. Zero skill in iOS.
I'm available for freelance work and it sounds very interesting, but
you've just outlined a huge project that would be a significant burden
even for the LoC's resources. That's not to say it can't be useful as a
blue-sky starting point for something more reasonable. If you have
funding, let's talk off-list. If you just want to continue blue-skying
the idea for a while, I'm glad to continue on-list (and I promise not to
bill you for that :).
--
Gary McGath, Professional Software Developer [log in to unmask]
|