pytesseract

Python wrapper for Google's Tesseract-OCR (Python 3)

Install

All systems
curl cmd.cat/pytesseract.sh
image/svg+xml Kali Linux
apt-get install python3-pytesseract

python3-pytesseract

Python wrapper for Google's Tesseract-OCR (Python 3)

This package contains a an optical character recognition (OCR) tool for Python. It will recognize and "read" the text embedded in images. Python-tesseract is a wrapper for Google's Tesseract-OCR Engine. It is also useful as a stand-alone invocation script to tesseract, as it can read all image types supported by the Python Imaging Library, including jpeg, png, gif, bmp, tiff, and others, whereas tesseract-ocr by default only supports tiff and bmp. Additionally, if used as a script, Python-tesseract will print the recognized text instead of writing it to a file. This package installs the library for Python 3.