dawg2wordlist

Raw OCR Engine

Install

All systems
curl cmd.cat/dawg2wordlist.sh
Debian Debian
apt-get install tesseract-ocr
Ubuntu
apt-get install tesseract-ocr
Alpine
apk add tesseract-ocr
Arch Arch Linux
pacman -S tesseract
image/svg+xml Kali Linux
apt-get install tesseract-ocr
Fedora
dnf install tesseract
Windows (WSL2)
sudo apt-get update sudo apt-get install tesseract-ocr
OS X
brew install tesseract
Raspbian
apt-get install tesseract-ocr
Docker
docker run cmd.cat/dawg2wordlist dawg2wordlist powered by Commando

tesseract

Raw OCR Engine

tesseract-ocr

Tesseract command line OCR tool

The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Between 1995 and 2006 it had little work done on it, but since then it has been improved extensively by Google and is probably one of the most accurate open source OCR engines available. It can read a wide variety of image formats and convert them to text in over 40 languages. This package includes the command line tool.

tesseract-tools

Training tools for tesseract