Build a Gemini Voice Assistant in Python

Показать описание

Build Your Own AI Voice Assistant with Python: Gemini + OpenAI TTS + Faster Whisper

In this in-depth tutorial, I'll show you how to create a powerful voice assistant using Python, the Gemini API, OpenAI's Text-to-Speech (TTS) engine, and the supercharged Whisper transcription model.

Key Skills You'll Master:

Gemini Integration: Learn to use Google's Gemini API for smooth, advanced conversation management with your AI.
OpenAI TTS: Give your assistant a realistic voice with OpenAI's high-quality text-to-speech streaming API.
Faster Whisper: Boost speech recognition accuracy and speed with the cutting-edge Whisper transcription model.
Python Coding: Follow my clear code explanations and build your project step-by-step.

#googlegemini #ai #python

Рекомендации по теме

Комментарии

Ai Austin you the man! Excellent content! Much respect! 🤜🏾🤛🏼

johnbarros

Thanks, I'm working on it tomorrow 😊

mbegangsylvain

if I could put my two cents on having or not the source code from discord or github. This video as it is, AI Austin is giving us the free source code of this amazing tutorial as always, we just have to be patient and following video and type away those line of codes // codes blocks and you got the full working snippet as always Thanks a lot AI Austin (Note: Copy / Paste really won't help you a lot to learn and understand any snippet explained regardless how good is the author explaining such as it is the case here. If one wants to learn about coding, python language and get 'first hand' experience, one must code, search, READ !!!, PRACTICE!!!, Golden rules try and error, you you stand up /// Error Crash --- Troubleshoot and fix "you stand up" Cheers!!! Happy coding !!!

dionisnavarro

Thnx but I am getting error module 'google.generativeai' has no attribute 'GenerativeModel' tried it in many versions of python and installed it many times.Please help

raunaksharma

I heavily modified your code and used it as a base, I integrated speech interupts, having it so there's a convosation loop until the language model classifies the convosation as dead or allp oints have been answered, I also went ahead and integrated functions, so it says a specific keyword then i use another script to monitor the output and it blocks the keyword from being said, then takes that and uses that keyword to trigger real actions, so far I've got it to turn my lights on and off

Nelson_Bazzard

Please show how to install it in Android as it would be more useful in Android, very good video thanks for the insights

rgspacelictics

i am a windows user can you give me the link of pyAudio binding for the portAudio library. help me i have to submit my ai project in 2 weeks
nice explaination
❤

Pro-edit-No.

Great tutorial man. I just wonder if running this on the cloud is possible? And what would be your recommendation?
Keep up the good work man!

DjTechDJ

8:39 this is so insane to me. That your actual code is a message directly to the AI. "Hey keep it short mr. It dont want any bullshit from you. I use you to talk and that's it"

Tomblom

This is a great tutorial! I was wondering if you could consider adding a vision component to it in the future will be 🔥

nehapant

looove this! thanks a lot! Please how do you get the mouth to move while you talk on the video? I have been trying to learn this! thanks again for that

Ronaldograxa

Newbie in coding here🙋‍♂. Hello from Mexico!! Hi Austin. For days now I have tried to follow tutorials to create my own AI assistant with the little understading I have about python. I was ALMOST succesfull to use Crewai but something always went wrong in the end so I got frustrated and ended up creating one in OPENAI in 2 minutes but i have to pay constantly for tokens.😔.

I came across your video and its amazing! Even someone like me can understand it. Thank you!!

Now, I have only 2 questions:
1. I didnt undersant in which of the code part should I include a "context" so I can give it a specific personality?
2. How does this code keep "memory" or keep track of the conversations so it keeps "learning" and being costumizable? (Aside from the pre-programmed context i have made assistants "learn" to analyze an attached file so it can understand better how to behave.)

Thank you in advance and I wish you all the best!

novadocnews

hello Austin everything was working so well until importing WhisperModel, then i got this error:

from av._core import time_base, library_versions
ModuleNotFoundError: No module named 'av._core'

I have been trying to solve it in a bunch of ways but still nothing T_T do you know how could i fix it? im working with windows wsl vscode thx you

anyways, is there an alternative library for achieving these same functions for the program?

patflc

Why did you use whisper instead of uploading the audio to Gemini since it handles it well natively with barely any downtime?

davidthiwa

This shit is too funny, I had to work my way because I did it in a conda env ( I do not recommand) and at the end not able to wake up Gemini (maybe my french canadian accent) I looked the end of the video to hear the prononciation and the video wake up my Gemini and wrote me a Dank rap song !!!!

julienduchesneau

How can you not like this style and learning something, you bring it as adventure story, very good use of GenAI, if you do this you can partner with Netflix

mulderbm

can we get the openai api without paying credits?

sowbharnikadevi

bro why are you not uploading the source code in github?
🙃

neoyt

With the google ai API key 3:49, How long I can use that API key? and is it completely free or do I need to pay any money? Anybody comment.

bosvikanimations

Bro why are suggesting paid audios like there are many free and open-source libraries for it like pyaudio

yunik_developer

Build a Gemini Voice Assistant in Python

Build a Gemini Voice Assistant in Python

Build a Google Gemini Powered Voice Assistant (chatbot) with Python

Hands-on Tutorial: Build a Google Gemini-Powered Voice Assistant on Raspberry Pi

Voice AI Assistant using Python Tutorial | Build JARVIS with Google Gemini

Google’s Gemini Live AI assistant is INSANE! #google #ai #tech

How to Use Google Gemini - Including New Prompts

Google Gemini vs Siri (Apple Intelligence) - 50 Commands Each

Talking to Google’s Gemini Live AI Voice Chat - Full Setup!

Virtual Assistant Upwork Bid Proposal | Personal Pitch Milestone for week 7 with Alx.

Google Demos 'Gemini Live' AI Android Voice Chatbot

Build a Multilingual Voice Assistant with Google Gemini API and Python

build a gemini voice assistant in python

How to Switch Back to Google Assistant from Gemini #gemini #google #ai

Google Gemini vs Google Assistant - Which One Is Better? (Tested On The Pixel 8 Pro)

Gemini Gems: Ultimate Guide to Creating Personalised AI Assistants!

Build a Vertex AI Gemini Powered Voice Assistant (chatbot) with Python

Gemini Live Is WILD | Hands-on with Google's new conversational AI

Google Gemini on Android: Full Review & Features

How to Use Gemini AI by Google ✦ Tutorial for Beginners

Google Assistant❌ Gemini AI✅ | AI in Smartphones

Build Your Voice AI Assistant Using Google Gemini AI For Phone Calls

Parallel Processing for Speeding-up a Voice Assistant with Google Gemini

How to Make the Most of Gemini: Your Guide to Using the AI Assistant

Bard is now Gemini! Get it on your Android device today! #Google #Gemini #bard