hxclean

HTML and XML manipulation utilities

Install

All systems
curl cmd.cat/hxclean.sh
Debian Debian
apt-get install html-xml-utils
Ubuntu
apt-get install html-xml-utils
Alpine
apk add html-xml-utils
image/svg+xml Kali Linux
apt-get install html-xml-utils
Windows (WSL2)
sudo apt-get update sudo apt-get install html-xml-utils
OS X
brew install html-xml-utils
Raspbian
apt-get install html-xml-utils
Docker
docker run cmd.cat/hxclean hxclean powered by Commando

html-xml-utils

HTML and XML manipulation utilities

HTML-XML-utils provides a number of simple utilities for manipulating and converting HTML and XML files in various ways. The suite consists of the following tools: asc2xml - convert from UTF-8 to &#nnn; entities xml2asc - convert from &#nnn; entities to UTF-8 hxaddid - add IDs to selected elements hxcite - replace bibliographic references by hyperlinks hxcite-mkbib - expand references and create bibliography hxclean - apply heuristics to correct an HTML file hxcopy - copy an HTML file while preserving relative links hxcount - count elements and attributes in HTML or XML files hxextract - extract selected elements hxincl - expand included HTML or XML files hxindex - create an alphabetically sorted index hxmkbib - create bibliography from a template hxmultitoc - create a table of contents for a set of HTML files hxname2id - move some ID= or NAME= from A elements to their parents hxnormalize - pretty-print an HTML file hxnsxml - convert output of hxxmlns back to normal XML hxnum - number section headings in an HTML file hxpipe - convert XML to a format easier to parse with Perl or AWK hxprintlinks - number links & add table of URLs at end of an HTML file hxprune - remove marked elements from an HTML file hxref - generate cross-references hxselect - extract elements that match a (CSS) selector hxtoc - insert a table of contents in an HTML file hxuncdata - replace CDATA sections by character entities hxunent - replace HTML predefined character entities to UTF-8 hxunpipe - convert output of pipe back to XML format hxunxmlns - replace "global names" by XML Namespace prefixes hxwls - list links in an HTML file hxxmlns - replace XML Namespace prefixes by "global names"