html2text

advanced HTML to text converter

Install

All systems
curl cmd.cat/html2text.sh
Debian Debian
apt-get install html2text
Ubuntu
apt-get install html2text
Alpine
apk add html2text
Arch Arch Linux
pacman -S html2text
image/svg+xml Kali Linux
apt-get install html2text
Fedora
dnf install html2text
Windows (WSL2)
sudo apt-get update sudo apt-get install html2text
OS X
brew install html2text
Raspbian
apt-get install html2text
Docker
docker run cmd.cat/html2text html2text powered by Commando

html2text

advanced HTML to text converter

html2text is a converter from HTML to plain text. html2text reads HTML documents supplied in the command line (or from standard input), converts each of them into a stream of plain text characters and writes output to the file or the terminal. Debian version also can recognize encoding of documents, do on-fly input and output recoding. html2text was written because the author wasn't happy with the output of "lynx -dump" and so he wrote something better.

python3-html2text

Python module for converting HTML to Markdown text (Python3 version)

html2text is a Python module that converts a page of HTML into clean, easy-to-read plain ASCII text. Better yet, that ASCII also happens to be valid Markdown (a text-to-HTML format). This package contains the Python 3 version of the library, and a script, html2markdown.

python-html2text

Python module for converting HTML to Markdown text

html2text is a Python module that converts a page of HTML into clean, easy-to-read plain ASCII text. Better yet, that ASCII also happens to be valid Markdown (a text-to-HTML format). This package contains the Python 2 version of the library, and a script, html2markdown.