Text Extraction from Product Images Using State-of-the-Art Deep Learning Techniques

preview_player
Показать описание
Extracting texts of various sizes, shapes and orientations from images containing multiple objects is an important problem in many contexts, especially, in connection to e-commerce, augmented reality assistance system in a natural scene, content moderation in social media platform, etc. The text from the image can be a richer and more accurate source of data than human inputs which can be used in several applications like Attribute Extraction, Offensive Text Classification, Product Matching, Compliance use cases, etc. Extracting text is achieved in 2 stages. Text detection: The detector detects the character locations in an image and then combines all characters close to each other to form a word based on an affinity score which is also predicted by the network. Since the model is at a character level, it can detect in any orientation. Post this, the text is then sent through the Recognizer module. Text Recognition: Detected text regions are sent to the CRNN-CTC network to obtain the final text. CNN’s are incorporated to obtain image features that are then passed to the LSTM network as shown in the below figure. Connectionist Temporal Classification(CTC) decoder operation is then applied to the LSTM outputs for all the time steps to finally obtain the raw text from the image.

Key Takeaways:

1. Understanding the need for text extraction from Product Images.
2. Deep Learning Techniques for detecting highly oriented text.
3. End to End understanding of CRNN-CTC network for text recognition with TF 2.0
4. Need for CTC loss and theoretical understanding of the same.
5. Usage of Text Extraction in various fields/domains.

About:
Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Connect with us:
Рекомендации по теме
Комментарии
Автор

bro can give or explain how to write the code for this application

suryatej
Автор

Hello, i am a software developer. I like your project, congratulations. I wanted to try your project, but I'm getting an "Unkown Layer: Functional" error. I could not find any solution. can you help me?
Thank you from now.

bengisutozoglu
Автор

Where can I find the source code for this project. Would love to recreate this to enhance its capabilities.

anupambanerjee
Автор

Great talk Rajesh.
Very intuitive..!!

AkshayBahadur
Автор

Can you give the model performance details like fps and accuracy

sahilsalimmakandar
Автор

the speaker should mention about the papers he is using to explain stuff, since this is a public platform, it is expected from the speaker.

suvarnadhiraj
welcome to shbcf.ru