ocrmypdf

Generate a searchable PDF or PDF/A from a scanned PDF or an image of text. More information: <https://ocrmypdf.readthedocs.io/en/latest/cookbook.html>.

Install

All systems
curl cmd.cat/ocrmypdf.sh
Debian Debian
apt-get install ocrmypdf
Ubuntu
apt-get install ocrmypdf
Alpine
apk add ocrmypdf
image/svg+xml Kali Linux
apt-get install ocrmypdf
Fedora
dnf install ocrmypdf
Windows (WSL2)
sudo apt-get update sudo apt-get install ocrmypdf
OS X
brew install ocrmypdf
Raspbian
apt-get install ocrmypdf
Docker
docker run cmd.cat/ocrmypdf ocrmypdf powered by Commando

Generate a searchable PDF or PDF/A from a scanned PDF or an image of text. More information: <https://ocrmypdf.readthedocs.io/en/latest/cookbook.html>.

  • Create a new searchable PDF/A file from a scanned PDF or image file:
    ocrmypdf path/to/input_file path/to/output.pdf
  • Replace a scanned PDF file with a searchable PDF file:
    ocrmypdf path/to/file.pdf path/to/file.pdf
  • Skip pages of a mixed-format input PDF file that already contain text:
    ocrmypdf --skip-text path/to/input.pdf path/to/output.pdf
  • Clean, de-skew, and rotate pages of a poor scan:
    ocrmypdf --clean --deskew --rotate-pages path/to/input_file path/to/output.pdf
  • Set the metadata of the searchable PDF file:
    ocrmypdf --title "title" --author "author" --subject "subject" --keywords "keyword; key phrase; ..." path/to/input_file path/to/output.pdf
  • Display help:
    ocrmypdf --help

© tl;dr; authors and contributors