Speech to Text with Python - Speech Recognition - From MP3 File

preview_player
Показать описание
Here we will get the text from a mp3 using SpeechRecognition module.

Support this channel, become a member:

🔥 Udemy Courses LOW COST Coupons/ Cupões Cursos Udemy BAIXO CUSTO 🔥:

With Udemy Courses you get/ Com Cursos da Udemy você terá:
♾️ Full lifetime access/ Acesso Vitalício Completo
📱📺 Access on mobile and TV/ Acesso no celular e TV
🏆 Certificate of completion/ Certificado de Conclusão

See other videos:
About Reportlab

Follow me on Facebook

Or Twitter

#AllTech #Python #SpeechToText
Рекомендации по теме
Комментарии
Автор

pyaudio not working on linux
any idea ?

mohamedelmdary
Автор

i got problem when i try to install pyaudio use piwin

ipwin install PyAudio
pipwin : The term 'pipwin' is not recognized as the name of a cmdlet, function, script file, or operable program. Check the spelling of
the name, or if a path was included, verify that the path is correct and try again.
At line:1 char:1
+ pipwin install PyAudio
+
+ CategoryInfo : ObjectNotFound: (pipwin:String) [], CommandNotFoundException
+ FullyQualifiedErrorId : CommandNotFoundException

randythamrin
Автор

Thanks a lot for the awesome tutorial. You are rock.

KhalilYasser
Автор

I need something that splits hours long mp3 audiobooks into chapters when it finds the phrase "Chaper N - The name of the Chapter" etc.

FPChris
Автор

No longer seems to work for MP3 files, you need to convert to wav I think

rmansfield
Автор

Audio file to text any open source library available for python?

balajicmb
Автор

It was Text to Speech, NOT SPEECH TO TEXT. Its different

Rin-lbbu
Автор

Every (valid) .mp3 file I use results in this error:

D:\Programming\MP3ToText>py .\program.py
Traceback (most recent call last):
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\site-packages\speech_recognition\__init__.py", line 203, in __enter__
self.audio_reader = wave.open(self.filename_or_fileobject, "rb")
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\wave.py", line 509, in open
return Wave_read(f)
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\wave.py", line 163, in __init__
self.initfp(f)
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\wave.py", line 130, in initfp
raise Error('file does not start with RIFF id')
wave.Error: file does not start with RIFF id

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\site-packages\speech_recognition\__init__.py", line 208, in __enter__
self.audio_reader = aifc.open(self.filename_or_fileobject, "rb")
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\aifc.py", line 917, in open
return Aifc_read(f)
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\aifc.py", line 352, in __init__
self.initfp(file_object)
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\aifc.py", line 316, in initfp
raise Error('file does not start with FORM id')
aifc.Error: file does not start with FORM id

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\site-packages\speech_recognition\__init__.py", line 234, in __enter__
self.audio_reader = aifc.open(aiff_file, "rb")
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\aifc.py", line 917, in open
return Aifc_read(f)
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\aifc.py", line 358, in __init__
self.initfp(f)
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\aifc.py", line 314, in initfp
chunk = Chunk(file)
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\chunk.py", line 63, in __init__
raise EOFError
EOFError

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "D:\Programming\MP3ToText\program.py", line 12, in <module>
with sr.AudioFile(mp3FileName) as source:
File "C:\Users\stephen\AppData\Local\Programs\Python\Python310\lib\site-packages\speech_recognition\__init__.py", line 236, in __enter__
raise ValueError("Audio file could not be read as PCM WAV, AIFF/AIFF-C, or Native FLAC; check if file is corrupted or in another format")
ValueError: Audio file could not be read as PCM WAV, AIFF/AIFF-C, or Native FLAC; check if file is corrupted or in another format

stephenbartlett