catdoc

MS-Word to TeX or plain text converter

Install

All systems
curl cmd.cat/catdoc.sh
Debian Debian
apt-get install catdoc
Ubuntu
apt-get install catdoc
Arch Arch Linux
pacman -S catdoc
image/svg+xml Kali Linux
apt-get install catdoc
Fedora
dnf install catdoc
Windows (WSL2)
sudo apt-get update sudo apt-get install catdoc
Raspbian
apt-get install catdoc

catdoc

MS-Word to TeX or plain text converter

This program extracts text from MS-Word files, trying to preserve as many special printable characters as possible. catdoc supports everything up to Word-97. Also supported are MS Write documents and RTF files. It doesn't even try to preserve fancy Word formatting, because Word users usually don't care about document structure, and it is this very thing which is important to LaTeX users. Also provided is xls2csv, which extracts data from Excel spreadsheets and outputs it in comma-separated-value format and catppt, which extracts data from PowerPoint presentations. This package suggests tk because it also includes wordview, an optional Tk-based GUI for catdoc. The MIME config provided in this package will use wordview if X is running, or catdoc directly if it is not.