Table of Contents

Archiving

Not to confuse with Compression.

Webpages

online tools

convert HTML to TXT

Use a text mode browser like lynx, links2 or w3m and dump its output:

lynx -dump -display_charset UTF-8 input.html > output.txt
w3m -dump -o display_charset=UTF-8 input.html > output.txt

(source)