ocr4gamera
toolkit for building OCR systems
Install
- All systems
-
curl cmd.cat/ocr4gamera.sh
- Debian
-
apt-get install python-gamera.toolkits.ocr
- Ubuntu
-
apt-get install python-gamera.toolkits.ocr
- Kali Linux
-
apt-get install python-gamera.toolkits.ocr
- Windows (WSL2)
-
sudo apt-get update
sudo apt-get install python-gamera.toolkits.ocr
- Raspbian
-
apt-get install python-gamera.toolkits.ocr
- Dockerfile
- dockerfile.run/ocr4gamera
python-gamera.toolkits.ocr
toolkit for building OCR systems
The Gamera OCR Toolkit is meant to help building optical character recognition (OCR) systems for standard text documents. Even though it can be used as is, it is specifically designed to make individual steps of the recognition system customizable and replaceable. It provides: * a flexible mechanism for plugging in custom page segmentation algorithms * heuristic rules for dealing with diacritics, and for disambiguation of common confused roman characters (like comma and apostrophe, or lower and upper case ‘W’) * a ready-to-run script ocr4gamera which acts as a basic OCR-system. Note that the toolkit does not include any training data.