Using Python & OpenAI Realtime API: Live Audio Demo + Function Calling to Talk with Your local PC

preview_player
Показать описание
Discover how to set up Python with OpenAI’s Realtime API for an interactive, real-time experience using your PC! In this video, I’ll walk you through connecting a live audio demo using a local mic and speaker, plus showcase function calling capabilities that enable direct interactions with your PC—like opening Notepad on command. Learn the ins and outs of configuring Python for these powerful integrations, bringing conversational AI directly to your desktop environment.
What You’ll Learn:
Configuring Python for OpenAI’s Realtime API
Setting up audio input/output with a mic and speaker
Using function calling to communicate with your PC and control local apps (e.g., open Notepad)
Perfect for developers, AI enthusiasts, and anyone curious about AI-powered desktop interactions. Don’t forget to subscribe for more engaging tutorials!"

Рекомендации по теме
Комментарии
Автор

Thanks for taking time and sharing your knowledge with us

Pure_Science_and_Technology
Автор

Thankyou, needed to know how to function call

catalyst
Автор

You should get a lot more views, thank you!

marouanbelhaj
Автор

Thanks for the video ! Is there a pre-built method to stop the realtime api automatically when the user is not speaking ( no speech detected) like a timeout attribute or something similar ?
thanks in advance .

oussamajmaa
Автор

I tried this without headphones and the feedback from the speakers made the app think that i was trying to interrupt it.. it is weird cause the nodejs/browser example code doesn't do this. do you have any ideas?

vietnguyen
Автор

Thanks for this video. So when I play this on my laptop, then the assistant voice sort of interferes with my own voice, and so the experience sort of looks bad! And when I try to plug my earphones, then the voice is not audible in my earphones. Any help?
Also can you also include as how you can do maybe RAG in this- with/without function calling.

adityajindal
Автор

I run it without wearing a headphone and it keeps capturing the output audio, causing the echo. Is there anyway I can solve it in the software?

Jetdohhhh
Автор

hi, i'm getting this error on run.

Failed to connect to OpenAI: Error connecting to SOCKS5 proxy 127.0.0.1:10808: [WinError 10061] No connection could be made because the target machine actively refused it

danklad