pstotext
Extract text from PostScript and PDF files
Install
- All systems
-
curl cmd.cat/pstotext.sh
- Debian
-
apt-get install pstotext
- Ubuntu
-
apt-get install pstotext
- Arch Linux
-
pacman -S pstotext
- Windows (WSL2)
-
sudo apt-get update
sudo apt-get install pstotext
- Raspbian
-
apt-get install pstotext
- Dockerfile
- dockerfile.run/pstotext
pstotext
Extract text from PostScript and PDF files
pstotext extracts text (in the ISO 8859-1 character set) from a PostScript or PDF (Portable Document Format) file. Thus, pstotext is similar to the ps2ascii program that comes with ghostscript. The output of pstotext is however better than that of ps2ascii, because pstotext deals better with punctuation and ligatures.