Real-Time Live Speech-to-Text | Streaming ASR Gradio App with Hugging Face Tutorial

preview_player
Показать описание
In this Applied NLP Tutorial, We'll learn how to build a Real-Time Automatic Speech Recognition powered by Facebooks Wav2Vec2 Deep Learning Model.

We'll learn to use Hugging Face Transformers Pipeline for Audio (Speech) to Text and Gradio for the Python Web app for live audio transcription.

Related NLP Tutorials -
Рекомендации по теме
Комментарии
Автор

வணக்கம் நண்பா, நான் இரண்டு நாட்களாக உங்கள் NLP காணோளிகளை கண்டு வருகிறேன். மிகவும் நன்றாக உள்ளது. I am recently get in to this NLP domain. your tutorials are awesome.

seankay
Автор

Great video! Which model do you suggest for translating Spanish audio to English text?

danielmoore
Автор

Awesome tutorial.

Please How can i create my own language translation for my local language?

actionmoviecabal
Автор

Please make videos on pyecharts in Jupiter notebook to create Dashboards since there are a few videos on that.

halkkoi
Автор

How can we improve the accuracy of transcription?

taarinidhulipala
Автор

Strange to "return state, state". Why twice?

mwd
Автор

Nice video, however, the performance (WER) of the real time asr using this gradio code is disappointing

cahyawirawan
Автор

Can I train my own data with that model?

fahieram
Автор

gr.inputs.Audio
not working ?? what to do please i am stuck

oo_anonymous
Автор

why its saying error 404 why? i run your code only that button is not showing off record

PressF
Автор

Respected sir it's working after recording the app it's working real-time. kindly provide a solution

ahmadjamil
Автор

Hi, its not working as yours, every time i need to stop the recording only then it transcribe

bhuvneshsaini
Автор

can we run this app on console without gradio on real time

ahmadchaudhary
Автор

can the same be implementated in streamlit

sibadattasasmal