This free AI Text-to-Speech is insane! Add emotions & make podcasts

preview_player
Показать описание
F5-TTS full tutorial, installation, testing. Free, open-source AI voice cloner with expressive voices. #ai #f5tts #rvc #aivoice #voicecloning

Resources:

Here's my equipment, in case you're wondering:

0:00 Intro
0:42 How it works
9:42 Installation
10:00 Git
10:50 Installation continued
12:29 Anaconda
15:22 Installation continued
21:25 FFmpeg
23:18 Installation continued
24:00 How to start
25:05 Text to speech
32:46 Adding emotions
38:25 Podcast generation
41:45 Other languages
Рекомендации по теме
Комментарии
Автор

This is wild! It’s crazy how little input audio it requires. Also I just wanted to say thanks. If it weren’t for you I would have never discovered my passion for creating AI voice models!

Dryesthalo
Автор

That mixing Chinese and English is simply perfect, any Chinese no matter it's Mandarin even Cantonese just speaks like that, the TTS shows no flaw with it's voice, tone and pronunciation, if I play that to my friends and family they can't really spot the common AI characteristics with it.

vinching
Автор

I follow your channel since the early days. I´m super happy for your growth and also super happy when you do content like this... for non-tech people to be able to try and have fun with AI. A dedicated video for everyone to follow. Keep up the good stuff!

HeRmEtIkA
Автор

That thumbnail... He knew what he was doing

liarus
Автор

This AI is really good...at sounding like a bad audiobook narrator! 😂 It nails those over-the-top emotions, but they don't sound very human. Maybe the problem is that it's trained on audiobooks, where the emotions are often exaggerated.
What if we used this "fake emotion" data to our advantage? First, train an AI to recognize those audiobook patterns. Then, train a second AI to spot real emotions in everyday speech from YouTube, podcasts, etc. The second AI could learn to tell the difference between fake and genuine, and we'd get an AI that truly understands how we express emotions! What do you guys think?

brianlink
Автор

sounds good, but not good enough. I'll wait a bit longer for an upgrade

froilen
Автор

The best part is we can use the existing XTTS set of tools to modify our own voices and create the emotional samples, for the existing voices.

viddarkking
Автор

This version of the tool is astonishing! It is exactly what I have been looking for.Thank you!

jihe
Автор

I watch lots of tutorials on youtube. This one is among the best. Keep up the good work and thanks for sharing your know-how!

cippalippa
Автор

I was also very surprised with how good this works... Thanks!

SkylineAICreator
Автор

The thumbnail man 🤣 man of culture! like and sub!

sunnyhaoshiyu
Автор

Crazy stuff! I'm glad i found this channel.

adelite
Автор

Great but needs to support more languages.

mohamedzewail
Автор

Truly appreciate the detailed installation procedure, made my life much easier. Thanks!

VoicelessScream
Автор

Voice synthesis with emotions? That’s a next-level breakthrough for personalizing user experiences. Feels like we're inching closer to seamless AI-human conversations.

AdvantestInc
Автор

I love how you do not assume that I know what you know, and bothered explaining the basics. and made time stamps for the more knowledgeable to skip. excellent man!!!

so we cant train it properly on a larger audio file (you cant pack enough vocal range in that for professional works..

steve-gjb
Автор

I hope one day someone make an open source ai that make songs like suno or udio

bause
Автор

I don't know where to go without you. You don't know how important you are in my life. Saved for later as usual. <3

Anamontes-ow
Автор

i alaway wonder why the requirements are never listed first ... xD (specs vram/ram req)

the chinese is insane . it always sounds more than the original voice lol

sasuofficial
Автор

I'm glad that this is being developed, even if it's still at a point where I wouldn't even enable it if it was as easy as a toggle, let alone dig into code to get it working.

..