Build a Custom OCR Model in TensorFlow: A Step-by-Step Tutorial

preview_player
Показать описание
In this tutorial, we will explore how to recognize text from images using TensorFlow and the CTC loss function in a neural network model. We will start with an introduction to text recognition and the different approaches used to extract text from images. We will then dive into the specifics of using TensorFlow and the CTC loss function to build our custom OCR system. The tutorial will also introduce a new open-source library called MLTU (Machine Learning Training Utilities) that can be used to store code for future projects. By the end of this tutorial, you will have a working OCR model that you can use to recognize text from images. This is the first part of a tutorial series, so stay tuned for more in-depth content on text recognition and other machine-learning topics.

#machinelearning #python #tensorflow #opencv #ocr
Рекомендации по теме
Комментарии
Автор

I have a project assignment. I am making a mobile application that will convert Braille to plain text. I will do the mobile part with Flutter. I don't know much about model training. Do you have any advice? How can I do this in the most correct way?

furkan
Автор

Hello, I have a quick question. I’m using a custom dataset. What’s the most ideal dataset size? My CER stays at 1.00. Is it because my dataset is too small?

nickmoreno
Автор

hi sir, just a primary question, what pre-knowledge do i need to fully understand the tutroial and the other ones?

Anas-nwmf
Автор

Hi I have a question. You trained your model with annotation_train.txt and annotation_test.txt. I am curious about what kind of things you wrote in those files. Because i am also trying to create my custom model. Thanks for your response in advance

AbduqayumRasulmuhamedov
Автор

Where can i find the dataset and the image folder?

winterx
Автор

which architecture is this model based on ?
Can you provide me a way for researching more in the state of the art of OCR especially for digital character recognition.

hamzaomari
Автор

Hi can I ask for your Dataset I really want to try to train the model again. Thank you!

midhauxgaming
Автор

Is there any tutorial u recomand for text detection please ?

bouchrasaidi
Автор

Hi Thanks for video. While I watching this, I saw that the WER is 1.000. what does it mean? why does the WER doesn't goes down?

astronaut
Автор

Thanks for the tutorial. Curious if there is a good tutorial on Text Detection you recommend?

maggiezhang
Автор

Hi, hello. I want to say that this tutorial is amazing!! Thank you very much.

alanferrari
Автор

On an M1 Max GPU training has been running for 3+ hours on first epoch still...I think maybe I've done something wrong but don't want to end the script at this point. I added many custom examples of alphanumeric sequences. I feel like the M1 Max should be able to handle a batch size of 1024. Do you think this sounds like a lower batch size is needed?

hovat
Автор

This was a great video! I’m trying to get an OCR model to work with Hebrew handwriting, what’s my best options for gathering a training set?

TeslaTube
Автор

Hey there I really like your videos but I got error at the beginning of the video, JUST when I start the package is not installed to my VS code please can I get help?

temepc
Автор

Hi, Thanks for all your Videos .Those Videos helped me a lot
you deserve more subs and views on your channel
Thanks Again

codewithme
Автор

First of all, well done for this video. It is very interesting. I just wanted to ask you a question. Is this possible to use this in a video instead of images? I am trying to train a model that reads number plates. However, number plates vary from a country to another and I am trying to train a model with my country numberplates format.

jongameshow
Автор

Hi thank you for posting ocr videos. can you please tell what is inference speed of the model while using CPU?

primalvision
Автор

what can I do if i'm getting the next error in training?: Failed to find data adapter that can handle input: <class 'mltu.dataProvider.DataProvider'>, <class 'NoneType'>

alexrobles
Автор

may i use this for business card extraction

_Jaisreekrishna
Автор

where can i get your complete notebook for the reference

pranay
join shbcf.ru