SUPER Fast AI Real Time Speech to Text Transcribtion - Faster Whisper / Python

Показать описание

SUPER Fast AI Real Time Voice to Text Transcribtion - Faster Whisper / Python

👊 Become a member and get access to GitHub:

Get a FREE 45+ ChatGPT Prompts PDF here:
📧 Join the newsletter:

🌐 My website:

Faster-Whisperer:

I created a almost zero latency real time AI voice to text transcribtion using faster whisperer and python. We are gonna look at some use cases for the script and a preview of my upcoming video. Enjoy!

00:00 Intro
00:21 Real Time AI Transcribtion "Mr.Beast"
01:25 Setup / Python Code
03:33 Real Time AI Transcribtion "Sentiment Analysis"
05:51 Real Time AI Transcribtion "Secret Project"
08:14 Conclusion

Рекомендации по теме

Комментарии

Epic! - These videos are some of the best stuff on YouTube - love the idea with the image generation at the end

OliNorwell

Tips: You can transform your device's audio output into a "microphone" on Windows, so you don't need to place your headphones over your microphone.

1. Press Windows key + R -> type "mmsys.cpl"
2. In the Recording tab, enable the Stereo Mix option. Now, "Stereo Mix" is an available microphone option! You can select it as the audio input.

bim-techs

Pulling in people with a flashy thumbnail of a Python code that works and then trying to monetize your code based on a library that is already supposed to be open source is in my opinion bs. it is not fair for beginners that might not know Python or whisper very well. for that I give you a thumbs down!

filipphenderson

This is amazing and inspiring. I love the ending of the video and can’t wait for Wednesday. As a dyslexic person I think you unlocked a new use case for learning.

theraybae

5:51 Neutral = I'm gonna go troll now. Funny stuff, great video! Thanks

jaujud

There is a product for Live video Transcription there. Live text services are expensive and does not work on many current languages.. Set up a server/service that will ingest a RTMP video source, delay the video and overlay text on video in perfect sync. then offer RTMP output with burned in Live text. :) There is need for this service.

ReadyMedia-no

Good to see transcription and generate responses as audio in real-time for phone call

benscottbongiben

Fantastic !!! A bit fast in explaining and showing, but I can always pause!

ArmandoMenicacci

Hey man this is really cool! I'd like to know if you:
1) used the whisper v3 model? or the v2?
2) If you have seen the demos from gpt4, they also showed that gpt ASR is better than whisper v3, wonder if it will be open like whisper.

ferluisch

Amazing and inspiring work! Kris what about something less powerful but better accessible in terms of hardware?

HammerOnTheNet

I have tried to get this to run on M1 MacBook. No joy. The CPU maxes out even with the tiny model. But then I tried with the Whisper.cpp implementation which is compiled for apple silicon. I found a whisper-cpp-python wrapper for that library. That actually runs and is far less CPU bound. It has a bit of a stutter, it is not as clean, it misses words between the chunk processing but you can see that with just a little bit more power it could work.

svenborgers

wow !! great video !!! Thank you for being so generous and teaching this to us, this is epic stuff! I can already start see all kinds of use cases, I cant wait to get it running, I'm really looking forward to Wednesday's video . Thanks again from Canada

ryanjames

Interesting stuff on the image creation at the end while talking, not sure if you are taking into consideration puctuation in you sentences? Im pretty sure this would have to do with something cool, maby keeping an overview of all the text that has been moving out of the "buffer" for style ? Looks like something I could have a lot of fun with, do not have the GPU though :/ Colab however.

kimsteinhaug

Nice video!! thanks for your help in this topics!!

cristobalmunoz

Excellent! Thank you so much for sharing!

radudamianov

Hello and great to see this kind of contents.

I actually have a question about speech to text in another language and for example Swedish.. and passing it throw llama for correction, .. maybe for a meeting conference or something like that .. what do you suggest ?

JohannaKarlsson

I have been looking where to start, fantastic work, where can I have the code for testing

hjoseph

Thanks for sharing your knowledge/experience.
I'm bit perplexed. The description here mentions 45+ prompts in the PDF book, the newsletter website says 40+, and the PDF doc says 35+. Which number is correct?

t-dsai

This will be a good tool for language immersion chinese / japanese / indonesian along with the deepl clipboard tool, edge browsers tts engine.

aoeu

thanks this is great! Where can I find the actual code you have on your screen? Struggling to find it on the github

magnoliasphinkter

SUPER Fast AI Real Time Speech to Text Transcribtion - Faster Whisper / Python

SUPER Fast AI Real Time Speech to Text Transcribtion - Faster Whisper / Python

Best FREE Speech to Text AI - Super Whisper AI - Fast AI Real Time Speech to Text Transcription

OpenAI's Whisper Realtime Speech Recognition Chatbot Test

INCREDIBLE Fast AI Real Time Speech to Text Transcribtion - Build From Scratch

World’s Fastest Talking AI: Deepgram + Groq

THE FUTURE OF HUMANITY: A.I Predicts 400 Years In 3 Minutes (4K)

Harnessing the Power of AI | Avichal Garg

Microsoft's Magentic One: This FREE AI AGENT can CONTROL BROWSER, DO CODING & MORE!

Detect Objects in REAL-TIME with YOLOv7 on Your Phone!

BIG UPDATE: AI Agent Now Calls And Book Appointments - OpenAI Realtime API

IQ TEST

Midjourney and Leonardo.ai are NO longer NEEDED | How to make UNLIMITED high quality AI images

AI Robot caught on cam fighting back at humans

7 New AI Tools You Won't Believe Exist

How To Self Study AI FAST

FAST real-time AI videos in 5 minutes! [Python Windows CUDA Tutorial]

Blazing Fast AI Generations with SDXL Turbo + Local Live painting

PyTorch in 100 Seconds

Training an unbeatable AI in Trackmania

Insane AI Learned Minecraft - One Step Closer to Simulated Reality...

The Top 9 AI Breakthroughs of 2024 (You Won’t Believe Are Real)

POV: someone saw your character ai chats #characterai

Fast counting with GPT-4o

Groq - Fastest AI with real-time data (using Function Calling)