Real-time Speech Recognition in 15 minutes with AssemblyAI

preview_player
Показать описание
Get your free speech-to-text API token 👇

Transcribing in real-time is a super skill only court reporters can brag about. But luckily, we don’t need to learn how to type fast to get transcriptions of audio quickly. Thanks to Assembly AI’s Streaming Speech-to-Text model (previously real-time speech recognition), it is very simple to set up a python script that can listen for audio and turn it to text.

In this video, we will see how to create this script on Python with the help of pyaudio, web sockets and asynchronous functions. The app will have the power to listen to audio input through a microphone and display the transcription in real-time. We will integrate this code into a simple Streamlit application to showcase the real-time speech recognition with a touch of interactivity.

Рекомендации по теме
Комментарии
Автор

Is there a JavaScript version of this video please? Both service side and front end…

pjayo
Автор

Most instructional and useful video. Thank you.

lfmtube
Автор

The accuracy and formatting is quite interesting here.

saifullahkhan
Автор

Very sympathic and engaging presentation. Maybe the most beautiful side of Artificial Intelligence 😊

otomakannioc
Автор

How can I use this for transcribing particular chrome tabs for online meetings? Can you help me with that?

ashiqashervegar
Автор

How can I introduce um, ahs, and other filler words in to the Realtime transcription? I see it is an option for uploading an audio file

HomelessRafi
Автор

Very kewl it works. However for the live service a $8 minimum is required. - but totally worth it

slimyelow
Автор

I've this installed and worked on windows. My question is that Real-time Speech Recognition only recognize english ? Does it support other languages ? Thank you.

lookersky
Автор

when i tried to install pyaudio terminal displays this error: "Cannot open include file: 'Python.h': No such file or directory "

adhikesavan
Автор

Very nice and easy setup! Took me ages to achieve the same with whisper. However, are you planning to support other languages, german, french or even a multilinugal model?

MrThought
Автор

Incredible video. Would I be able to use chatGPT, as an intermediate, to correct the sentences based on vocabulary and grammar, and after that, get the response from that chatGPT "reviewed" sentences?

claudiotassis
Автор

I want to implement a realtime app using voice recognition on python 3 / android 11 on my samsung s22. It's my understanding portaudio is NOT supported on Android 11. Is portaudio your only way to get to the mic?

weebiesoftware
Автор

This is really awesome! I would prefer though to see the final result as a short highlights at the beginning of your videos before you go into the details of how to. But thanks anyway for the effort and the time! Hugs!

Asparuh.Emilov
Автор

As of now, only the English language is supported in the live transcription feature. when can we expect more language support ?

kshitizyadav
Автор

Does it also offer diarization in real time?

fahnub
Автор

Could you please instruct me on how I can include this in my Angular project?

PoojaVerma-slmg
Автор

Can you please let me know how can I integrate this in my JAVA Maven project?

moncefarajdal
Автор

hi great works but what about smartphone ?

usus
Автор

is it possible to transcribe in real-time in other languages, such as spanish?

Miguel-hqlx
Автор

I know this is a little old, but I get a 4002 error, what might cause that?

spinal_cord