OpenAI Whisper - MultiLingual AI Speech Recognition Live App Tutorial

Показать описание

OpenAI Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. Whisper works with multiple low resource languages including Tamil, Hindi, Telugu, Malayalam and more.

1littlecoder

Рекомендации по теме

Комментарии

Golden Content! Just started working on a project and this is a very helpful resource to implement. Thank you!

chaithanyavamshi

Hi and thank you! I find your content so inspiring! Definetly trying this app.

fedahumada

Love the channel, you should have many more subs! ❤

concretecw

Best content ! Thanks
Can we calculate confidence interval of each word transcribed?

TejasNarola-utci

Kudos to you if you prepared the Colab files!

byGDur

To run OpenAI Whisper LARGE model, how does the RTX 4090 compare to this setup on AWS - NVIDIA A10G Tensor Core GPU, g5.xlarge with 16GB RAM. Can I expect faster or slower transcription with the 4090?

georgepatronus

Bro really amazing content hatsoff to you

gowthamdora

May I ask, once the web demo is done with basic UI web using Gradio, how can we migrate this to a proper web app, like standalone webapp, can you please guide a little ?

appstuff

Hey Bro awsome, what accuracy does this STT has for tinglish, tamil + english ?

ChetanGJ

This is a great demo, thank you!

I am new to programming. Can our local machines handle this or should we do it in google collab?

flawedthoughts

can it do realtime transcription instead of processing audio file ?

abhignaconscience

Can the RTX 4090 run Openai Whisper LARGE model well, on an i9 1TB Nvme SSD 12th Gen gig that has 64GB DDR5 RAM?

georgepatronus

Hello dear, video is really very helpful for me. I am trying to build asr for Sanskrit language. It is not working for that. Could you help me how to train sanskrit data? Or any videos that will help me for building sanskrit asr. I have a parallel sanskrit data.

tapanray

Hello, thank you so much for your tutorial. I am trying to use Whisper for my master's thesis in translation technologies. The only issue I had was that after importing gradio and recording live a short audio so Whisper can transcribe, it doesn't work, it just keeps loading and loading forever even if it's just a 6 second audio. What do you suggest I can do? Thank you again from Spain!

annaacedoortega

I would recommend Streamlit to build front-end interface.

raydenx

What happens if the audio clip is longer than 30 seconds???

antonkal

how can you have it process a multilingual audio?

laylabitar

! pip install gradio -q

this code shows me error
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
spacy 3.7.4 requires typer<0.10.0, >=0.3.0, but you have typer 0.12.3 which is incompatible.
weasel 0.3.4 requires typer<0.10.0, >=0.3.0, but you have typer 0.12.3 which is incompatible.

now what to do please reply fast

ufzomqk

Very cool. Can we use openai whisper to IVR telephony. Like it needs to address clients from multiple languages like Hindi, telugu, Malayalam, tamil, English and respond accordingly

GeorgeMathew

Thank you for the tutorial.

When I tried to step through your Gradio app, I got errors when trying to import your audio clips.
When I disconnected and copied your code to my own Google Drive, I was able to at least record audio with my own microphone and see Whisper transcribe up to 30 seconds.

chrontexto

OpenAI Whisper - MultiLingual AI Speech Recognition Live App Tutorial

OpenAI Whisper - MultiLingual AI Speech Recognition Live App Tutorial

OpenAI Whisper model: ASR for many languages AND other languages to English translation model #nlp

Best FREE Speech to Text AI - Whisper AI

How to Install & Use Whisper AI Voice to Text

Transcribe Audio Files with OpenAI Whisper

What is OpenAI Whisper? (Best Speech to Text AI Model)

SUPER Fast AI Real Time Speech to Text Transcribtion - Faster Whisper / Python

OpenAI's Whisper Realtime Speech Recognition Chatbot Test

Python + OpenAI Whisper: Advanced Speech Recognition

Open AI’s Whisper is Amazing!

OpenAI Whisper Demo: Convert Speech to Text in Python

Use OpenAI Whisper For FREE | Best Speech to Text Model

OpenAI Whisper: Robust Speech Recognition via Large-Scale Weak Supervision | Paper and Code

Building an Audio Transcription App with OpenAI Whisper and Streamlit

OpenAI's Whisper Model Explained

OpenAI Whisper and Python: Easy Speech to Text

Coding My Own Virtual Voice Assistant | OpenAI Whisper & StreamLit

OpenAI's Whisper Model Explained: What is it and what can it do?

OpenAI Whisper - Translate and transcribe your video and audio at command line

OpenAI Whisper: Convert Speech To Text | OpenAI Whisper Explained in 8 Minutes | Simplilearn

OpenAI Whisper API with Python latest tutorial | How to convert mp3 audio to text AI example

OpenAI Whisper Tutorial + Audio to Text Translator Website Project 🔥

Google's Chirp AI vs. OpenAI's Whisper AI (Speech-to-Text)

2. OpenAI Whisper - Fed Speech Recognition