Build a Custom ASR Model in TensorFlow: A Step-by-Step Tutorial

Показать описание

Learn the basics of speech recognition with TensorFlow and build practical applications with this tutorial. Discover the history of speech recognition and the challenges that come with dealing with human speech variability, similar-sounding words, and low-quality audio. Explore the various techniques used in speech recognition, such as machine learning algorithms like deep learning, Hidden Markov Models (HMM), Dynamic Time Warping (DTW), and phonetic-based approaches. Discover how transformers have transformed the field of speech recognition and how they can be used to recognize different languages, understand natural language, and distinguish between similar words. Follow along with the tutorial to build a basic speech recognition model using TensorFlow, combining a 2D convolutional neural network (CNN), recurrent neural network (RNN), and Connectionist Temporal Classification (CTC), and apply this knowledge to develop practical applications.

#machinelearning #python #tensorflow #opencv #ASR

Рекомендации по теме

Комментарии

A good presentation. Thank you for providing this information.

vkrts

did you use Mel frequency cepstral coefficients (MFCC) as feature extraction?
if no, what is the feature extraction used?

mariamjbani-amer

Thank you so much. Can you also provide a video for TCN model? I am struggling to get the result using TCN.

shrijanregmi

This is so nice. Thank you very much for sharing your knowledge.

omochi

thank you for efforts, after train and save model how i use to transcript other audio not the one i trained and exist on csv file ? please tell me ? another thing how i know train is good with curves.

space_x

That's great, thanks for your sharing.
After creating the model, can we use this model with openai whisper ?

tringuyen-ivyf

thank you for the nice tutorial I think you did it with CTC mode which is sequence to sequence. I want to do the same project by using my dataset by using Listen attend and spell model and there is no any tutorial done on that area can you help me on how to implement it??

GelanaAbdisa

can i use this for making a model for arabic language ?

pesworld

Thanks.. Fantastic work.. Please can I run it in my own CPU computer??

mustafaaa

Will there be a PyTorch version of this tutorial??? It would be great. Thanks for such helpful video.

kishanbangsi

why you select 1000 as epochs number ?

mariamjbani-amer

nice explaination but please can you add a method in which user can recognize his own voice by repeating dataset sentences

navyaanzaheen

when i try your code, on the output folder model I did not get model.onnx file
and when i test .h model i get error message said "model, onnx not found"
can you help me ?

mariamjbani-amer

I am looking for some resources to learn ASR but I couldnot find good resources so could you please share me some ASR resources. Thank You!

ishanpanta

Could you please make video on project converting text to speech ?

yashkewlani

can you provide your pretrained model for use as we cannot train on cpu

RoshanRawat-gv

why dont you put microphone on your model? i just wonder

melapobia

Build a Custom ASR Model in TensorFlow: A Step-by-Step Tutorial

Build a Custom ASR Model in TensorFlow: A Step-by-Step Tutorial

Speech Recognition in Python | finetune wav2vec2 model for a custom ASR model

Build your own real-time voice command recognition model with TensorFlow

I Built a Personal Speech Recognition System for my AI Assistant

build a custom asr model in tensorflow a step by step tutorial

Build a Speech Recognition System on a Raspberry Pi

Mia Chang - Running the First Automatic Speech Recognition (ASR) Model with HuggingFace

Getting Started With Hugging Face in 15 Minutes | Transformers, Pipeline, Tokenizer, Models

Build Speech Recognition for any Language with 🤗 Transformers - Finetune XLSR-Wav2Vec2 (Hindi)

Best FREE Speech to Text AI - Whisper AI

14. Building ASR model in Kaldi Toolkit using GPU

PyTorch or Tensorflow? Which Should YOU Learn!

PyTorch in 100 Seconds

Conformer-1: a new large scale/robust speech recognition model

Python Speech Recognition Tutorial – Full Course for Beginners

OpenAI Whisper Demo: Convert Speech to Text in Python

Best Fast Food Combo Meal

How to Tame the New Carcharodontosaurus in ARK #Syntac #Ark #ArkSurvival

My Latest Bike & What I Built It For - YETI Cycles ASR XC MTB Custom Build

Master Fine-Tuning OpenAI Whisper with PyTorch for Custom ASR Tasks || PART-1

Build and Deploy a Machine Learning App in 2 Minutes

Tutorial 2- Fine Tuning Pretrained Model On Custom Dataset Using 🤗 Transformer

Computer Science: Build Automatic Speech Recognition (ASR) from scratch

Building an AI-Powered ASR Engine for Vernacular Languages