NVIDIA’s New AI: Stunning Voice Generator!

preview_player
Показать описание

📝 The blog post and paper are available here:

📝 My paper on simulations that look almost like reality is available for free here:

Or this is the orig. Nature Physics link with clickable citations:

🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
Alex Balfanz, Alex Haro, B Shang, Benji Rabhan, Gaston Ingaramo, Gordon Child, John Le, Juan Benet, Kyle Davis, Loyal Alchemist, Lukas Biewald, Martin, Michael Albrecht, Michael Tedder, Owen Skarpness, Richard Sundvall, Taras Bobrovytsky,, Thomas Krcmar, Tybie Fitzhugh, Ueli Gallizzi.

#nvidia #fugatto
Рекомендации по теме
Комментарии
Автор

Create a sound of 100s of fellow scholars holding on to their papers.

Loctorak
Автор

The true Turing test. When your videos will be 100% AI generated and your audience don't notice.

HansMilling
Автор

Let's just hope that Nvidia has the balls to actually release this

AGIzero
Автор

The demonstration of emotional nuance in synthesized speech is a game changer. Imagine the potential for storytelling and immersive experiences, AI is truly blurring boundaries here.

AdvantestInc
Автор

1:35 This is the most impressive part to me. An angry voice making a declarative statement like that with those emphasis pauses is super realistic and I'm impressed the AI replicated not only the sound, but the pacing as well. What a time to be alive!

3:54 I think the correct approach here is for Dr. Zsolnai-Fehér to release a hit single song of his own so he can use his song whenever he wants :D

HDL_CinC_Dragon
Автор

I can't wait for 1, 000's of AI-Generated Song Slop in my feed!!!

Rygarrr
Автор

That train one was absolutely beautiful omg

SnoopyDogg
Автор

As a video editor and sound designer.. What a time to be alive

Shimulahmed
Автор

Your videos always makes me smile ... so entertaining!!

galefraney
Автор

0:23 I don't know any of those models, which one correspond to udio or suno ?

BobbyMasteria
Автор

The next needed step is for audio AI systems to be able to handle spatial localization of sound. To be able to generate stereo or multichannel audio that is spatially coherent. So far all of them generate only mono audio.

HectorCenteno
Автор

to be honest it starts to get also scary when it comes to jobs, dating etc.

Perqd
Автор

Hello Bot, give me the sound of a joyous nerd like me, who knows more than I do about things, is amazing at teaching, has a deep accent who is super excited about the time to be alive!

SK-gcxv
Автор

Rafael Valle is an incredible personality in Audio AI domain.

ananthakrishnank
Автор

4:17 AI still doesn't know how cats wear headphones

richcolour
Автор

I am looking forward to real time voice generation for virtual characters. When those characters are given general understanding and comprehension tools like in modern LLMs peopl will be able to talk naturally to any character and ask them questions and voice generation will be key to enable the characters to respond.

Slayerthegreat
Автор

Generalist being specialist! What a time to be A I

kaspernordlund
Автор

Christopher Walken: "Kids are. Talking. By the door!"

ordinator.
Автор

Basically beating the Adobe audio to audio paper

kirangouds
Автор

The funny thing is. You definitely could replace yourself with audio generation (if your not already), and even if got foind out because it generates something funky, you can be like "woah, you found it out! Good work, I got away with skiving off on a beach for 6 months before you guys got suspicious! Isn't that incredible!?". No backlash from AI usage.

davescott
visit shbcf.ru