Low latency AI voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming.

Показать описание

Short prove of concept code for a real-time ai companion. Note: This demo is conducted on a 10Mbit/s connection, so actual performance might be more impressive on faster connections.

Linguflex

Рекомендации по теме

Комментарии

Incredible work!
Found your projects today and I cannot describe in words how impressive this all is. +1!

khacuu

Incredible! I was working on the same project and had the issue of TTS latency: any Cloud TTS service has latency that is too high for real-time purposes. Definitely going to implement you approach. Thanks!

alexandresajus

Great work! If you had a strong enough computer you can run a smaller 13B model with fast tts with much lower latency

aorusaki

It's impressive! Which GPU are you using?

sergitorrabadella

Very nice. Greatjob❤
Out of curiosity, how would you handle back to back conversation with interruptionhandling without using space?

NAE

Actually you want about 100 MS of delay at the very least. We're human and take time to process information and it would just seem unnatural to have a conversation where you felt like someone was finishing your sentences for you all the time.

Smashachu

Please make a tutorial video for installing ai, I tried following the guide but I couldn't do it.

Anonymos

Wow this project is insane is it possible to exchange openai with an llm instead to have 100% offline voice assistant ?

arsenlupin

hi Buddy!!
Im trying this approach but getting error, I have trained voice assitant using langchain and gpt 3.5 turbo and using elevenlabs api and opean ai api but latency is not reducing

akashraut

Hey brother! When i am running your program it is showing rate limit error. btw I am using free tier of openai

preenanahnaf

Low latency AI voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming.

Low latency AI voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming.

World’s Fastest Talking AI: Deepgram + Groq

100% Local AI Speech to Speech with RAG - Low Latency | Mistral 7B, Faster Whisper ++

Local Low Latency Speech to Speech - Mistral 7B + OpenVoice / Whisper | Open Source AI

SUPER Fast AI Real Time Speech to Text Transcribtion - Faster Whisper / Python

Creating conversational chatbot with high quality, low-latency speech

New low latency AI Voice Agents. AI books appointment in a phone call

GPT-4o Low Latency Screen to Voice Tutorial - SUPER IMPRESSIVE OCR!

3 Best Text to Speech Realistic Ai Voice Generator | 100% For Free | Ai Voice Generator

WhisperFusion: Seamless voice conversation with AI (ultra-low latency)

a 1hr voice convo with AI (VapiAI)

RealtimeSTT: A low-latency speech-to-text library with advanced voice activity detection

RealtimeSTT: A low-latency speech-to-text library with advanced voice activity detection

Creating Low Latency Voice Agents - Open Source 🗣️🗣️🗣️

Low latency voice to text transcription in real time

Instant Audio Streaming with ElevenLabs AI Voice API - Here's How

Introducing Deepgram Aura: Lightning fast text-to-speech API for voice AI agents

Boult Earbuds with Intelligent AI Voice Assistance

Do NOT Use This 'Free' AI Voice Changer

ZERO LATENCY Claude 3.5 + GPT-4o Voice Conversation | Python Threading

FREE AI Voice Tool: Text-to-Speech (TTS) & Voice Cloning - MetaVoice

Which AI can generate the most realistic voice? ElevenLabs vs Synthesia vs Murf AI!

Talk to ChatGPT: Voice to Voice

Optimizing Settings in AI Voice Changer Client