Extract text from Images with pytesseract & OpenCV (2021) | Python | #pyguru

Показать описание

In this tutorial we will be learning about how can we apply OCR using python, pytesseract and OpenCV.

Here we will learn 3 things
* Reading & displaying images using OpenCV
* Extracting text from images using pytesseract
* Text localisation or drawing bounding boxes around text using Pytesseract & OpenCV

Then install these two packages from pip

pytesseract : pip install pytesseract
opencv : pip install opencv-python

*************************

OCR Description

Optical Character Recognition(OCR) has been a popular task in Computer Vision. The popularity is because of its wide range of applications. It can be used for Data Entry for Business, Number Plate Recognition, Automated Passport Recognition, Quick Document Verification, IoT Application, Task Automation, and many more. Basically, any application which has a need to extract text from an image.

Tesseract is the most open-source software available for OCR. It was initially developed by HP as a tool in C++. Since 2006 it is developed by Google. The original software is available as a command-line tool for windows. We are living in a python world. Because of its popularity. The tool is also available in python developed and maintained as an opensource project.

Python-tesseract is a wrapper for Google’s Tesseract-OCR Engine. It is also useful as a stand-alone invocation script to tesseract, as it can read all image types supported by the Pillow and Leptonica imaging libraries, including jpeg, png, gif, bmp, tiff, and others. Additionally, if used as a script, Python-tesseract will print the recognized text instead of writing it to a file.

#pytesseract #ocr #python #iot #pythonlearning #automation #sublimetext4
#pythonguru python guru