Using high quality local Text to Speech in Python with Coqui TTS API

preview_player
Показать описание
Tutorial showing you how to setup high quality local text to speech in a Python script using Coqui TTS API.

Please subscribe to my channel 😊.

00:00 Intro
00:50 Preparations
02:00 Create TTS HelloWorld script
05:00 Testing the script
06:30 More info on codebase
07:18 Outro

#texttospeech #python #api #privacy

---
Рекомендации по теме
Комментарии
Автор

I can really recommend to activate GPU computing. On my system it lead to a 6x speedup!
tts = TTS(model_name="tts_models/de/thorsten/tacotron2-DDC", gpu=True)

Talaxianer
Автор

This was very helpful...seems that chatGPT doesn't know about Coqui. It's quite incredible when you compare this to 'say' on the Amiga or the talking program on the Atari ST, well, it's quie incredible full stop in all honesty.

PooperScooperTrooper
Автор

Hello - thank you very much for this work and this video. I was working on a personal project and ran into some issues with another TTS package and I've been feeling pretty bummed but now I'm thinking this might work for me so I'm gonna give it a go and watch more of your videos. Thank you!

christopherwoods
Автор

Finally something better than espeak 🙏
Waaay better

johnpaulvela
Автор

@ThorstenMueller thanks, will definitely check it out! How much faster is piper compared to coqui tts?

alancriaxyz
Автор

I have another issue, it seems I get this error,

raise Exception(" [!] No espeak backend found. Install espeak-ng or espeak to your system.")
Exception: [!] No espeak backend found. Install espeak-ng or espeak to your system.

I did install espeakng 1.0.2 but the error remained

raphaelbird
Автор

Beautiful way of doing! Great! Thank you so much

zerthura
Автор

Good Video, It will be nice to see what other extra stuff we can do with the api or it only does synthetization without other options? for example tortoise can do stuff like changing the mood of the speaker if you say I am happy in the prompt. I wonder if Coqui have modulation options

AiEdgar
Автор

I would kill for a 'Windows version tutorial for someone who doesn't know Python :)'

MaximBordyug.
Автор

I have trained custom model. How to load it in the api?

weebprogrammer
Автор

Is it possible to change a vocoder in this code example, as well as, is male voice available for english in TTS? Thanks for the answer

entl_
Автор

How to make audio output to 44.1kHz? I have bad quality audio - only 24kHz sounds like walkie-talkie radio speech..

PlayGameToday
Автор

Amazing content @Thorsten-Voice. However, the processing time is 75 seconds for creating a wav file with 2 sentences input and the cloned voice created from a reference wav file. Is there anything I can do to make it run faster? Otherwise it is not usable in a web application for instance. I've seen your pinned gpu=True comment, but are there also other ways? Or is one of the models significantly faster?

florishol
Автор

How can I change the tonality or modify aspects regarding the VOICE from pyhton

AdrianFlores-dcvu
Автор

Does this not work with the newer version of python?? I'm using 3.11.3 looks like you need 3.9

raphaelbird
Автор

How to clone specific voice for hindi language in tortoise or coqui model.

shailendrarathore
Автор

Hallo,
dein Video ist echt gut gemacht und es hat alles super funktioniert. Ich habe eine Frage: Gibt es eine Möglichkeit meine eigene erstellte Stimme zu verwenden (du hast es ja auch geschafft, deine Stimme zu nutzen)? Also quasi mein eigenes tts_model verwenden. Wenn ja, wie erstellt man so ein model überhaupt. Ich bin mir sicher du kannst helfen.
Danke für das Video und vielleicht auch eine Antwort

lolzoje
Автор

Kannst du darauf eingehen, wie Umlaute funktionieren und ob, und wenn ja wie, inline-Englisch funktioniert? Danke!

martinparidon
Автор

Is it possible to run the TTS model we developed in google colab using tacotron 2 with this?

mir_intizam
Автор

can I use my own dataset? if so, how to create and connect it

deprome