Speak Any Language With AI - Realtime Speech-to-Speech Translation & Voice Synthesis (w/Code)

Показать описание

In this video we dive into real time speech to speech translation, speaking in one language, and having your own voice speak in a different language!

Resources -

Chapters:
00:00 - Intro & Demonstration
00:46 - High Level Overview
01:06 - AssemblyAI For Speech to Text Streaming
02:30 - How to Use STT Streaming Output
03:48 - Using OpenAI as a Translation Service
04:51 - STT Streaming With Translation
05:51 - ElevenLabs Voice Cloning
07:01 - ElevenLabs Python Voice Synthesis
08:38 - Putting it All Together
09:00 - Outro

Рекомендации по теме

Комментарии

As a personal study, this is a great sharing, but AI phones such as iOS or Android will soon integrate relevant functions for real-time calls (phone calls or online meetings). Of course, privacy protection will be a constraint

xiaodongdong-lx

The problem is there is no East Africa Ethiopian Ahmaric language

simont

Thank you for the video. I am currently living in Osaka, Japan and I am very interested in Instant Translation with AI models. However, what I understand by "Instant Translation" is not: "I say a sentence - The model translates it after a few seconds and I can hear it - I say another sentence - The model translates it after a few seconds and I can hear it..." What I understand by Instant Translation is: "You are talking in Japanese and, while you are talking in Japanese (with a delay of a few senconds), I listen your speech in Spanish. No matter how long it is the speech. May be the Japanese speech is 10 minutes long and I can begin to listen to it after 5 seconds in Spanish and will end 5 seconds after finishing in Japanese". Basically it is like having a interpreteur by your side who doesn't have to wait until the end of the speech to begin translating. That way, the conversation gets more fluid.
I know this is not an easy task, as there are SOV and SVO languages. However, I think that Seamless m4t model is able to take this into account aswell.
Do you think is it possible to implement such a thing with this model?

MisionJapon

Hi @Adam ! I just messaged you on linkedin! Would love to chat.

CarasGFTK

Awesome project! Is it possible to use another service as translation rather than Chatgpt that doesn't require a subscription?

alejandroGTES

Hi, I am very interested in your script, but I can't seem to get it running. I don't understand where to input the API keys for each program, as there is no such section in your script. I am encountering a lot of errors. I really need your help.

deintez

God, the day we have this in real time with low latency for livestreams will be amazing. I understand English perfectly well but I don't feel confident streaming in another language lol.

JohnMaverick-wc

Hi Adam, thank you so much for sharing this video! This is exactly what I've been searching for. I'm actually looking for an AI developer to help me create an MVP app for my startup business in Japan in the beauty industry. Would you be open to discussing potential work opportunities, or is this more of a hobby for you?

ploylovespeach

Its huge latency… who said its realtime

smilebig

Hey Adam is there way to book a 1 on 1 to see if you can help me with this. I just need to get gpt + asembly Ai for the project I want.

dcleinad

Really impressive that combination of these 3. But to have a perfect loop how to deal with an input audio (voice) in real time before start speaking to respond ? And another question the generating audio at last could be an emulation of your microphone ?

nilamara

And it can be used to communicate in discord?

vvnter__

Speak Any Language With AI - Realtime Speech-to-Speech Translation & Voice Synthesis (w/Code)

What's the BEST AI For Language Learning? (CLEAR winner)

AI Video Translator Clones Your Voice & Syncs Lips in Seconds

Get fluent with AI - Use ChatGPT to learn and practice English

Speak English with AI (Spoiler Alert: That's Awesome!)

Translate Video into ANY Language with AI | Your Own Voice

A.I. - The END of Language Learning?

Introducing Talkpal - AI Language Tutor

Why AI doesn't speak every language

All AI speak German

HeyGen AI Translation Can Translate Video into ANY Language!

How to Translate Video into ANY Language with AI | Own Voice | FREE

I CAN SPEAK IN ANY LANGUAGE by using this AI

I Learned Portuguese 100% from AI Teacher, Brazilians Stunned

🔴 Use Google AI to Speak 32 Languages in Real Time

I Was FLOORED. Realtime AI Translation & Voice Cloning!

I Took Xiaoma's A.I. Language Tutor For 30 Days

Watch Lionel Messi Speaking 7 languages (with AI)!

How to use AI for practicing ENGLISH SPEAKING (for free)

BEST AI Dubbing | ElevenLabs

Use AI to Clone Voices & Speak OTHER LANGUAGES! - Elevenlabs + ChatGPT 4

The secrets of learning a new language | Lýdia Machová | TED

Multi-Language Text to Speech AI (FREE multi-language ai voice genration software)

Do We Still Need to Learn a Language in the AI Era?

I Can Speak 8 Languages! | Get My AI Voice Over | Insane AI Voice Generator