Optimizing Settings in AI Voice Changer Client

preview_player
Показать описание
Links referenced in the video:

Hardware for my PC:

Alternative prebuilds to my PC:

Cheapest PC recommended:

Come join The Learning Journey!

If you found anything helpful, please consider supporting me and the content I am trying to produce!
Рекомендации по теме
Комментарии
Автор

Great video. thank you
I'm one of the VCClient and RVC contributors. There are some additions to the content of the video.

Regarding the difference between the f0 estimator harvest and crepe, in addition to the sound quality, harvest uses a CPU and crepe uses a GPU. Crepe can improve latency if you have a good GPU.

In sever mode you can choose the sound driver. VCClient measures latency within VCClient, but additional latency is added when connecting to other devices.
Besides MME, WASAPI and ASIO can be selected, so if you can use them, I recommend using them.

For the protect item in advanced options, if protect is set to less than 0.5, the ratio of retrieved features will be reduced in cases where f0 estimation is unsuccessful (silence or breath sounds).

nadare
Автор

From a musician experience: if you have ASIO supporting soundcard - use ASIO instead of MME. It decreases the audio delay provided by audio tract (e.g. on my PC guitar/mic recording delay for 1024 samples chunk is 180ms for standard MME, and 14ms for ASIO). Theoretically WASAPI can also work fast however I don't have WASAPI supported hardware.

stnhndg
Автор

This worked so great on my first Voice Changing experience. Other videos on your channel are also great! Thank you very much.

wp
Автор

Thanks for all the help! You have responded to all of the comments and provided everything Ive needed. Anyways keep up the great work and keep doing what you are doing 👍

thatonecole
Автор

Thank you so much for making this video. I have a RTX 3090 and was wondering why no matter how much I messed with the Extra and Chunk settings, the voice still sounded distorted, but it was probably because I had the pitch too high where the voice probably wasn’t trained (I was using above 20 pitch) I didn’t know that most voices are trained no higher than 12. I’ll mess around more with this program tomorrow with what I learned from this video.

erobot
Автор

Tech is getting so cool, great video!

OtterPirate
Автор

Great video as always glad you went through everything with a good explanation for everything! Keep up the great work, and i am excited for what will come after RVC!

PhoonG
Автор

hearing senchou speaks english just feels weird, not in a bad way it's just feels like im hearing something im not supposed to hear in my entire life

lollmaonice
Автор

Great! can't wait to try this out to see if it improves performance. If it doesn't I might just install windows 11 to match your settings exactly.

tetragrammaton
Автор

Great video! I'm glad you showed what this program is capable of on a 4090. It seems we're not quite there yet with AI voices. I wonder if this is a small hurdle that will be overcome soon or a insurmountable mountain like hands are to AI art.

bwowzah
Автор

Thanks, I really need to try this one.

MrtnX
Автор

i literally just upgraded from my 1050ti to a 4070 today just to use this + other AI tools. love these tutorials

CoronaBorealis
Автор

you're a legend!! insane video quality and tutorial, can't believe i found pure gold at 4 am.
i guess youtube can also be a chad and recommend really good content wow

humble
Автор

If anyone has issues exporting an ONNX file and getting an error message in the GUI (it usually just says error message: no error message), but if you check in the console it says that pytorch has tried to allocate VRAM and has failed. A quick workaround for this that worked for me was changing in the GUI to use my CPU instead of my GPU and then exporting the ONNX worked. Afterwards you can change it back to your GPU.

Kyuubical
Автор

Hear Botan/Marine speaking clear English is kind of weird&awesome at the same time XD

Ariacompany
Автор

Damnn, the intro transition was goood

Solo_Recapping
Автор

question, how to train? what does train do?

i followed all previous tutorial and the voice output only sounds distorted repetitive. im using ryzen 3600x with gtx 1050ti 16gb ram.

kujii_
Автор

Using a RTX 3070 with:
Chunk 256
Extra 131072
Sounds perfect on these even with the half second delay!

Odyssey_ACNH
Автор

Very helpful 👌 . Just where can you get models?

FooLXawsome
Автор

I hope they will make it in VST3 format so i can just put it on the daw track which my microphone is routed through. it would be so amazing holyy

katbwoi