LISTSERV 16.5 - CODE4LIB Archives

Hi Charles,

I'm not familiar with any on that list, but you may be interested in
Whisper: https://github.com/openai/whisper

Rather than a traditional "pipeline" ASR software, this model utilizes
machine learning. It was built by OpenAI, the folks who made ChatGPT. I've
been impressed by it.

   1. It runs on local hardware, no connection to an exterior server needed
   2. Able to translate speech to text in real time. (Kaldi, the topmost
   example on your list has been noted as being "several times slower":
   https://deepgram.com/learn/benchmarking-top-open-source-speech-models#kaldi-gigaspeech-xl
   )
   3. Open Source software licensed under the MIT license, so it can be
   used & modified for free in private or commercial settings.
   4. I know it runs on Windows 10 (see the setup section on the github
   page), and it should be compatible with Linux systems, using an installer
   like Anaconda:
   https://www.linuxlinks.com/machine-learning-linux-whisper-automatic-speech-recognition-system/


Best of luck with your project!
- Kaleb A (Langara LIT Student)

On Fri, Dec 8, 2023 at 1:28 PM charles meyer <[log in to unmask]> wrote:

> My esteeme listmates,
>
> Has anyone used any of these in Windows 10 or any Linus distro?
>
>
> https://www.ubuntupit.com/best-open-source-speech-recognition-tools-for-linux/
>
> Thank you,
>
> Charles.
>
> Charlotte County Public Library
>