I haven't tried it yet, but the Open AI foundation recently released an open source neural net called Whisper for transcribing/translating English audio. Here's the page: https://openai.com/blog/whisper/ And here's an article about it: https://arstechnica.com/information-technology/2022/09/new-ai-model-from-openai-automatically-recognizes-speech-and-translates-to-english/ I haven't experimented with it yet, but I'm kind of interested to try it. Will -----Original Message----- From: Code for Libraries <[log in to unmask]> On Behalf Of Dan Johnson Sent: Friday, October 21, 2022 1:21 PM To: [log in to unmask] Subject: Re: [CODE4LIB] video to text Thank you, Peter Murray, for the fascinating AWS Transcribe writeup. In case someone is interested in going down that route, I did have success, some years ago, taking a JSON file someone else had generated from AWS Transcribe, and converting it into a very readable .docx. It requires only a simple two-line Python script with the tscribe library. Information here: https://github.com/kibaffo33/aws_transcribe_to_docx On Fri, Oct 21, 2022 at 2:10 PM Peter Murray < [log in to unmask]> wrote: > I did something like this last month for creating transcripts from > podcasts using Amazon Transcribe. Details and links to the code here: > https://dltj.org/article/generating-podcast-transcripts/ > > > Peter > > From: Dan Johnson <[log in to unmask]> <[log in to unmask]> > Reply: Code for Libraries <[log in to unmask]> > <[log in to unmask]> > Date: October 21, 2022 at 2:01:30 PM > To: [log in to unmask] <[log in to unmask]> > <[log in to unmask]> > Subject: Re: [CODE4LIB] video to text > > If your university gives you an Office 365 account (and Notre Dame > does), Word 365 will transcribe up to 300 minutes of audio per month > from a sound file in the .wav, .mp4, .mpa, or .mp3 formats. In my own > (admittedly minor) tinkering, I've been surprised at how good the > transcript is. Microsoft has a 90 second tutorial here: < > > https://support.microsoft.com/en-us/office/transcribe-your-recordings- > 7fc2efec-245e-45f0-b053-2a97531ecf57 > >. > > If you're handy with AWS, you can also use Amazon Transcribe ( > https://aws.amazon.com/transcribe/), but that is much more involved. I > have no experience myself, though some colleagues have had success > with larger projects there. > > Best, > Dan > > > On Fri, Oct 21, 2022 at 1:58 PM Lolis, John <[log in to unmask]> > wrote: > > > I don't have technology to offer for that purpose, but if you decide > > to > go > > with a service, I can tell you that I've found Amara to be very > affordable, > > of excellent quality and fantastic customer service. I used them to > > not only caption videos but to also translate them from several > > languages. I couldn't have asked for a better experience with them, > > and that was after some back and forth working things out over the extra languages. > > > > https://amara.org/ > > > > John Lolis > > Coordinator of Computer Systems > > > > 100 Martine Avenue > > White Plains, NY 10601 > > > > tel: 1.914.422.1497 > > fax: 1.914.422.1452 > > > > https://whiteplainslibrary.org/ > > > > *“I would rather have questions that can’t be answered than answers > > that can’t be questioned.”* — Richard Feynman < > > > > https://click.fourhourmail.com/5qure95xkf7hvvo93wh2/7qh7h8h05vr4zrtz/a > HR0cHM6Ly9lbi53aWtpcGVkaWEub3JnL3dpa2kvUmljaGFyZF9GZXlubWFu > > >, > > theoretical physicist and recipient of the Nobel Prize in Physics in > > 1965 > > > > > > On Fri, 21 Oct 2022 at 13:20, Eric Lease Morgan <[log in to unmask]> wrote: > > > > > Do you know of a video to text applications? I colleague asked me: > > > > > > I have four video recordings of conference sessions and wonder if > > > there is a tool or technology that will help me transcribe these > > > into the written word? > > > > > > Do y'all have any suggestions or experience in this regard? > > > > > > -- > > > Eric Morgan > > > University of Notre Dame > > > > > > > > -- > *Daniel Johnson, Ph.D.* > *English; Digital Humanities**; and Film, Television, and Theatre * > *Librarian* > *Navari Family Center for Digital Scholarship, **Hesburgh Libraries* > > *University of Notre Dame* > 250C Hesburgh Library > Notre Dame, IN 46556 > o: 574-631-3457 > e: [log in to unmask] > -- *Daniel Johnson, Ph.D.* *English; Digital Humanities**; and Film, Television, and Theatre * *Librarian* *Navari Family Center for Digital Scholarship, **Hesburgh Libraries* *University of Notre Dame* 250C Hesburgh Library Notre Dame, IN 46556 o: 574-631-3457 e: [log in to unmask]