AI Text to Speech in 10 Minutes with Python and Watson TTS

preview_player
Показать описание
Tired of speaking on webex?

Not so pumped to give that speech?

Just plain can't be bothered talking?

Forget it, just use text to speech to do it for you in 10ish minutes! You can speed up your ability to convert written text to AI powered neural network generated speech in minutes. Plus there's support for a whole bucketload of different languages whether you're speaking Dutch or Mandarin, it's got you covered!

In this video you’ll learn how to:
1. Set up the Watson Text to Speech Service
2. Convert Text to Speech Using Python and Watson
3. Convert Text using different language models including French

Github Repo for the Project:

Want to learn more about it all:

Oh, and don't forget to connect with me!

Happy coding!
Nick

P.s. Let me know how you go and drop a comment if you need a hand!

Music by Lakey Inspired
Рекомендации по теме
Комментарии
Автор

Thank you for the video.
Any chance to make the speaker sound less robotic?

incrementis
Автор

Thank you so much that was useful and super simple . Keep it up !!!

farahfekih
Автор

Great tutorial, but how can you change where the file goes? Right now its saving to my desktop, but I want it to save to another folder. How can I do that?

Just-Relax.
Автор

Hi Nick, thank you for the helpful video. What if I wanted to make each line a separate audio file? (dividing up the paragraph)

TheDemolitionmech
Автор

Hi Nic, it was a nice tutorial. I've just tried the code and found this problem: "It is required that you pass in a value for the "algorithms" argument when calling decode()".

alvarosaez
Автор

You have to register with IBM to use Watson. To register you have to give them your credit card details, which I am not prepared to do.

tobys
Автор

how to adjust speech rate in google colab?

NewHorizon
Автор

Great job! Hi Nic, following this video, I converted a text file with two sentences into mp3 file successful. I want to have a pause (1 second or 1.5 seconds) between two sentences, how to do it? do you have another video or sample for doing so? Many thanks.

kelvinfm
Автор

Thank you a lot, exactly what i was looking for!!!

blenderdad
Автор

Thanks for a super Tutorial Nicholas.
I am stuck with this error message, cant seem to resolve it with the resources available on google.
Any help would be greatly appreciated.

testkitseurope
Автор

Wow, this was super helpful. Any idea on how to circumvent the max characters that are allowed to go to IBM Watson ? Right now your solution works if it is a small amount of text for the file, but if the file is large than a certain amount of characters, you get an error when using your method.

KnowFunOfficial
Автор

That was a very useful tutorial. Thank you.

johanvandemerwe
Автор

thanks nick. this video been a great help. when I m trying do TTS in Spanish, Spanish text is not being identified correctly. As a result incorrect Spanish audio is getting generated

santoshnaik
Автор

Thank you so much for this video! Is there a way to get the audio file link instead of the audio itself?

AdinanBrito
Автор

Thank you sooo much. I have a question, is there any method to integrate IBM TTS to the apple mac speech. I want to call the TTS to read the selected screen text, it is possible?

MonkeyDLuffy-cqlo
Автор

Thanks for the engaging video. Could you please help me with the error message 2:47 zsh:1: command not found: pip ?

gilsmadi
Автор

LOVE IT! hello nicholas, im new to NLP and ML/AI and just started learning it, im about to work on project of a movil app translator of text to sign language, is any api out there than can help me speed up the process of the project? or any advice on how to approach it? with the little i know i was planing on a sort of clasification model from text to img, but i think it wont be the best thing to do if i want to add also the option to convert the img(sign language image) to text to make the translator more flexible.

keep the good work, this is rlly a great content!

LpARTURO
Автор

Is there a way to have each sentence be saved at it's own mp3 file?

mikepierce
Автор

Can AI interpret numerical trends then convert it speech? E.g. video game, sports commentary at halftime.

Ricocase
Автор

hey Nic, thaks for the video
plz let me know if there's anyway that we can play the audio file directly
Instead of saving it locally and then playing

saalemrafiq