Build your own real-time voice command recognition model with TensorFlow

preview_player
Показать описание
In this TensorFlow Tutorial we build our own real-time voice command recognition model that can then control a game.

▬▬▬▬▬▬▬▬▬▬▬▬ CONNECT ▬▬▬▬▬▬▬▬▬▬▬▬

▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬

Timeline:
00:00 Intro
00:52 Build Model: Google Colab Walkthrough
05:09 Save & Download model
08:15 Add our preprocessing code
12:39 Code the final project with microphone input
18:44 Final project testing!!!

#MachineLearning #DeepLearning
Рекомендации по теме
Комментарии
Автор

This is fantastic. I’m a Newbie to Python and neural nets, but your explanations are great and pretty straightforward. Question - what additional steps would I take to run this on my own local device (pi 4)? And what else would I need to do to introduce new commands such as as trigger word and “turn off the lights”? Would I need to create my own audio samples, save them to new folders, and retrain to retrain the model? Thanks for any guidance! (if you couldn’t tell I’m DONE w Google Home latency, recreating my own. Ambitious! Need help!)

donahue
Автор

Thank you very much for the tranings. But I think there should be a more complex and more advanced voice recognition, voice classification and similar training series if you see fit. You know, trainings on sound are limited.

gokhanersoz
Автор

Good video, excellent explanation, I have a question, can the same program be trained to recognize only a specific voice? if so, could you explain it to me? I would be very grateful.

erickd
Автор

Can you do a video regarding the newer version? The run interface now has a different code

seanadin
Автор

The code on TensorFlow website was changed :(

nguyent
Автор

Can you please post building text to speech models from scratch?

geekyprogrammer
Автор

What did you do so the program does not picks up ambient noise or actually works with the commands given? it seems the model lacks ambient noise data sets and whenever ran it only keeps spamming the first command, but yours works perfectly, how to achieve this?

Cyka_Blyatus
Автор

This tutorial is great. I find that the key to build accurate model is gathering quality data a lot. And that sounds arduous work. didn't get good result with 200 examples.

Edit: I found the model's accuracy is the way poor than I expected. Maybe it's due to the microphone I'm using and it's needed to taken care of before predicting process.

oxydol
Автор

thankyou dude its a hundred percent work for me but after couple minutes it crashed :(

itsrairamones
Автор

On which Tensorflow version this was made? because Colab uses latest, but older one should work without problems.

MrIlvis
Автор

Can i get similar for English alphabets

swasthikk
Автор

They changed the Code. Could u you do a quick update?

danielbogemann
Автор

How can it be that in the video it gives nothing with out speaking. While if i run the code from github it predicts random stuff when im not speaking.

TheSaukkio
Автор

Got the error: "Could not import the PyAudio C module 'pyaudio._portaudio'."
And couldn't find the solution...
Macbook M1 Pro

tankado_ndakota
Автор

Please provide me the model i need argently I am stuck in it

sanjeetjha
Автор

Its a shame, you cant train your own model.

YvtqKn
visit shbcf.ru