Step-By-Step Handwriting Words Recognition With PyTorch

preview_player
Показать описание
In this tutorial, we will extend the previous tutorial to build a custom PyTorch model using the IAM Dataset for recognizing handwritten text. This dataset is commonly used as a benchmark for OCR systems and can provide a valuable foundation for constructing your own OCR system. We will be using several machine learning libraries and techniques to preprocess the data, augment it, and train a deep learning model.

During this tutorial, we will cover the following:
- An overview of the IAM Dataset and handwritten text recognition;
- Code walkthrough for importing required modules and libraries;
- Downloading and extracting the dataset using the download_and_unzip function;
- Preprocessing the dataset, including data parsing, vocab set creation, and maximum label length;
- Data augmentation techniques to improve model performance;
- A deep dive into PyTorch model training with custom CTC loss function and callbacks;
- Evaluation metrics like CER and WER to monitor training progress;
- Saving and exporting the trained PyTorch model in ONNX format.

By the end of this tutorial, you will have a good understanding of how to train a custom PyTorch model for recognizing handwritten text using the IAM Dataset. Join me in this exciting journey of handwriting recognition with PyTorch!

#machinelearning #python #pytorch #ocr #tensorflow
Рекомендации по теме
Комментарии
Автор

Hello, my dear friend! Honestly, words can't express how impressed I am by the amazing content you create. Your way of explaining AI model development is truly inspiring and reflects your extensive expertise😍😍. I have a dream of designing a model that can extract handwritten Arabic text, but I feel a bit lost on where to start. Could you kindly guide me with your great advice or provide some initial steps to get started? I'd be so grateful for your support, and thank you for all that you share!

m_e
Автор

@PyLessons  when i try to execute fit method
I got error
UnboundLocalError : cannot access local variable 'loss_info' where it is not associated with a value

nareshmalviya
Автор

Thank you so much but please can you tell how I can use my inputs to test it I've already trained with a different dataset

ekchills
Автор

how can I use the model to use my camera to scan a handwritten word?

AyenTorres-wegp
Автор

The text version of the tutorial has a google drive link at the end containing the trained model but I am unable to get it running
can I get some help ?

amigohundred
Автор

Thank for the video, I wanna use your code, but I have a large word dataset should change anything to you code when training?

سيفالباشا-ذع
Автор

hey how can I use nougat to make it work more efficiently with maths and other things to any idea?

rishabh
Автор

this works well with the dataset images but if i pass some other word images not from the dataset then it cant predict. same thing happens with the tensorflow model as well. Am i doing something wrong?

ParikshitBarua
Автор

Hello. Thank you for the tutorial! I attempted to run the code on my end, but I get a 502 bad gateway for dataset link provided. Was the link changed?

mahmoudelsayed
Автор

Great video !
Question - What if we want to extract text from image, (Not hand written) ? Will the same model work ?

aspboss
Автор

Hello, I tell you that I should try to do the first thing, having the minimum required to start with the code. This is the import of the libraries but I get error after error, did you already have those libraries installed before? Or did you install them for this video?

jahstinarguedas
Автор

Hello Thank you very much for your content. Can I please know that can I use this code foridentify handwritten text in a full page

pasinduminiruwan
Автор

@PyLessons data_preprocessors=[ImageReader()],

TypeError: ImageReader.__init__() missing 1 required positional argument: 'image_class'
I got this error, how to solve this .I enable to find

SaiGaneshNeerumalla-so
Автор

Can you please tell me how can we take input from our side after training the model with datasets ??

yashkewlani
Автор

Hello👋 can you please attach the links of latest datasets that are available. It would be a great help because i have project deadline within a week😅

hollybollyentertainer
Автор

Thank you so much! very well explained. But I'm getting while trying to download dataset, it show error "HTTPerror: Bad Gateway"
Please help me in this regard if possible

science
Автор

This works if image only contains 1 word or sentence (like 1 in your tensorflow video), but what if I want to train it on document like form or invoice what should I do?

sahilpawar
Автор

Hi Pakistan i need Medical Handwritten doctor prescription dataset for Machine learning in
: Urdu

afrididanial
Автор

ModuleNotFoundError: No module named 'mltu.torch.losses'

I already install mltu==1.0.1, but still didn't work

arifzanko
Автор

How can i modify the code to process the data once?

ruckydelmoro