Speech to Text - Real Time Streaming Transcription

preview_player
Показать описание
#datascience #speechtotext #machinelearning

Deepspeech is an open-source voice recognition or speech to text system that uses a neural network to convert speech spectrogram into a text transcript

This speech recognition system developed using end-to-end deep learning

The acoustic models were trained on American English and language model improves the accuracy of the predicted transcripts

Acoustic model determines the relationship between audio signals and phonetic units in a language, while a language model matches sounds to words and word sequences.
Рекомендации по теме
Комментарии
Автор

Im getting error 'ERROR: Could not find a version that satisfies the requirement deepspeech'. Can you provide a solution for this?

hssp
Автор

I have this error CreateModel failed with 'Failed to initialize memory mapped model.' (0x3000)

juanandreslopezcubides
Автор

Just a quick question. You are passing entire audio at once. Is this function will work for real-time streaming audio like from a microphone?. Just like how Android Speech works. Tap on the mic and it listens realtime.

professorspawn
Автор

Hi i work on a little project with (VOSK asr..kaldi model) and i have an issue, on the first its work good, but on timeline it will be slower on the translating words, im not talking about accuracy but on the time of responding
i thinking about loop of while (Python) my be overload or somthing, can you help me please ?, thanks

مولالشاش-ثص
Автор

I want to convert the audio from a browser into text.

AbdulWahab-mpvn
Автор

I could really use this but with the option to print the phonemes instead of the actual text. would this be possible??

lenover
Автор

Can you do with microphone ? Can speak show the output instead of using audio file

purushothaman
Автор

Hii sir. I have one doubt this video using wav file length is very short time, so please tell me how much maximum wav file length time duration ?

balajicmb
Автор

Hi,

For me it's just printing he he he he.... On every new line.. Am I missing something?

TPrakashpra
Автор

Can streaming transcription be done on CPUs only? Or is GPU preferable for quick output on large amount of files?

gauravvij
Автор

hey, I am not able to install deep speech on my pc it throws an error(ERROR: Could not find a version that satisfies the requirement deepspeech (from versions: none) ERROR: No matching distribution found for deepspeech ) I am using windows 10 2020 october update

parinaypanwar
Автор

Would there by chance exist a "Deep Speech operation and options for the impatient and code-challenged Techno-moron"? Thx!

jcw
Автор

Premium content. Absolute Genius. Is it also possible for you to share the collab files in the video description?

professorspawn
Автор

Can we create android app for the same?

shubhamkhamkar
Автор

Hi Nice video. Is this model works for our indian language.

raghudatheshgp
Автор

Can we do speaker diarization using deep speech..?

souramrakesh
Автор

Please what is your name on LinkedIn, I have been working on speech to text slfor a month now.... So u can share some ideas with me
Thanks

akinsnath
Автор

Hi sir, kindly share codes for study purpose

SivaShankarsss
Автор

Can you please share this Colab file? In GitHub?

torshamondal
Автор

Very good video.I am really excited and want to have a direct connection and communication with you. Is it possible please?

Abdulsaleem