Local and Open Source Speech to Speech Assistant

Показать описание

In this video, I'll walk you through how to set up a completely local voice assistant using my project, Verbi. We'll configure three local API endpoints: Fast Whisper for speech to text, OLAMMA for the language model, and Mello TTS for text to speech. Make sure to check out my previous videos for initial setup instructions and enjoy experimenting with state-of-the-art speech components!

LINKS:
Verbi Videos:

💻 RAG Beyond Basics Course:

Let's Connect:

Signup for Newsletter, localgpt:

TIMESTAMP:
00:00 Introduction to Verbi
01:15 Setting Up Local Models
02:56 Configuring Fast Whisper API
04:41 Installing Mello TTS
08:47 Running Verbi and Testing
12:36 Conclusion and Future Updates

All Interesting Videos:

Рекомендации по теме

Комментарии

I'm looking at the github repo and it's super easy to follow. The modular nature is nice. I had cycled between using speechrecognizer with vosk, then speechrecognizer with a local whisper, then realtime transcription with whisper. But I kept having to duct tape solutions for problems that are easily solved by separating it out. Verbi helped fill in some gaps I hadn't considered. I'll be sure to leave a note in my code's readme referencing your project.

mathiasgentech

wow that Local TTS sound so natural, really cool

RickySupriyadi

Great example. How can you integrate a RAG engine to accept word and excel files?

kryptonic

Is the project "verbi" still active?

bizmark

How hard would it be to add rag support to this?

preben

The TTS sound a bit robotic but I like ❤ Appreciate the hard work. Keep it 100 💪

brto

Have anyone of us here building the TTS on our own local languages beside the English?

SAINGSAB

Local and Open Source Speech to Speech Assistant

Local and Open Source Speech to Speech Assistant

100% Local AI Speech to Speech with RAG - Low Latency | Mistral 7B, Faster Whisper ++

Local Low Latency Speech to Speech - Mistral 7B + OpenVoice / Whisper | Open Source AI

My Top 5 Open Source Text to Speech Softwares Starting off in 2024

My Top 5 Open-Source AI Text-to-Speech Models

Open Source AI Beats ChatGPT, ElevenLabs & Manus!

Run Text-to-Speech Locally: Step-by-Step Guide

how to find the best free text-to-speech

Creating Low Latency Voice Agents - Open Source 🗣️🗣️🗣️

my local, AI Voice Assistant (I replaced Alexa!!)

Clone ANY Voice In SECONDS - AllTalkTTS Setup - Check Out The Guide! #ai #voice #technology

Free & Open-Source AI Voice Cloning - Check It Out! #ai #tech #voiceover

ChatTTS - Best Quality Open Source Text-to-Speech Model? | Tutorial + Ollama Setup

Make an Offline GPT Voice Assistant in Python

STOP PAYING for AI Voices! Run Kokoro TTS Locally or FREE on the Cloud !

open source TTS and voice cloning #waifu #ai #sillytavern

OpenAI Whisper? No! There Are Better Options

Open Source Speech Technology for Everyone | L3-AI 2021

Sesame CSM 1B Local Test & Install (A VERY Good Speech Model)

Local AI Voice Generators Are Here!

Can Zonos AI voice clones compete with ElevenLabs?

Run Free Text-to-Speech Locally on Open WebUI: Kokoro TTS Setup Guide (Windows)

How to Start a Speech THE RIGHT WAY #shorts

3 FREE Elevenlabs AI Alternatives (Best AI Speech Text to Voice)