Build a voice assistant with OpenAI Whisper and TTS (text to speech) in 5 minutes

preview_player
Показать описание
🚀 Ever wanted to create your own voice-to-voice chat assistant? This video is your fast track to making it happen! I'll guide you through building a voice assistant using OpenAI's Whisper for voice recognition and TTS (Text to Speech) for voice output. And the best part? It's all done in NodeJS in just 5 minutes!

🕒 Chapters:
00:00 Introduction to Building a Voice Assistant
00:45 Live Demo
01:30 Diving into the Code
05:45 Implementing OpenAI Whisper API
06:50 Integrating OpenAI Chat Completion API
08:10 Adding OpenAI TTS (Text to Speech) API
10:40 Conclusion and Next Steps

#OpenAI #Whisper #TTS #tutorial
Рекомендации по теме
Комментарии
Автор

My word, I can't tell you how much I now look forward to your videos! Keep up the great work!

TestTalk
Автор

Just tried this, works great, thanks and I liked it too!

nabgilby
Автор

Hi, great video, well above my level, but I have a quick question, could you actually have a 'meaningful' conversation with at as you would with chatgpt?

marcuscarter
Автор

fantastic! thanks so much for sharing, this exactly what I was looking to do

JoJoAcrylicArtwork
Автор

Immense value bro thanks for the informative videos!

mahtabalam
Автор

I just found your content and am glad you are making tutorials on this. Have you been able to mitigate the latency?

biancapietersz
Автор

Awesome, can't wait to try. Too bad GPT is all jacked lately. How would one do this using a wakeup word or other stimulation to get the program's attention?

AI_Escaped
Автор

Hi there, thanks for this great job.

Can you tell us how can we make this 2in1, meaning to give audio responses also when the users type the questions not only when they speak it?

Thank you!

Edit:
Never mind, chat gpt updated the code, and now it works via messages. Thanks.

crististanciu
Автор

You need more subscribers mate, 2.5k is a shame to be honest given the knowledge you are sharing, what is the YT algo up to?

mickelodiansurname
Автор

Pricing:

Google
Transcription: $0.024 / minute
TTS $0.016 / 1K characters

Open AI
Whisper $0.006 / minute
TTS $0.015 / 1K characters
TTS HD $0.030 / 1K characters

greendsnow
Автор

Can we do the speech to text part with Whisper from OpenAi but the actual response from some other GPT model? like Gemini or my any other local model endpoint other than ChatGpt?

firaunic
Автор

As I understand, it is connected to general gpt 3.5 model, not to customized API Assistant? It would be cool to create same voice-input - voice output but with your own customized assistant. In a similar way, the did during DevDay presentation :)

EL-tirol
Автор

Love this! Thank you! How would I swap out OpenAI TTS for Eleven Labs TTS model?

ventureaddict
Автор

Excellent content... I'm also having an issue with 'node install speaker'. Rosetta didn't seem to help. Any other ideas? Without speaker, the app otherwise seems to work but fails after hitting 'enter'

AndAllTravel
Автор

Hello, How can I attach the audio to an assistant using threads messages? Thank you

MariastellaALBARELLI
Автор

Is there a way to customise the voice?

pennychewer
Автор

Are there ways to tweak the output in terms of pacing and vocal intensity?

aranthos
Автор

Hey great vid, anyway to add tts as a function to the new GPT4 preview openai assistant.thx

musumo
Автор

hey can you i add this to UI and how i can do can you advise me please? thank you

doston
Автор

unfortunately I got stuck with an error:

Press Enter when you're ready to start speaking.

Recording... Press Enter to stop

Recording stopped, processing audio...
Error: 400 - Bad Request

usumzkn