Short: Strip junk from HTML document
Author: Thomas Aglassinger
Uploader: Thomas Aglassinger
Type: comm/www
Version: 1.1 (5.1.99)
Requires: util/rexx/rexxdossupport.lha
Architecture: m68k-amigaos
Kurz: Entfernt Ranz von HTML-Dokument
TITLE
RaiskaaHTML
VERSION
1.1
AUTHOR
Thomas Aglassinger
DESCRIPTION
RaiskaaHTML strips various crap from HTML documents, making them use
less storage space and forces them to look like specified in your
browser settings instead of conforming to the personal taste of some
brain-dead web author.
It can remove annoying colors, invisible meta tags, useless scripts,
bloated comments, redundant white space and more.
It is intended to be used by people who often download HTML
documents from the web and store them on their local system to have
more reliable and quicker access to them.
FEATURES
- Removes colors, font faces and alignments
- Removes redundant white space like empty lines
- Removes inline scripts like JavaScript
- Removed invisible tags like and
- Removes SGML comments and DOCTYPE declaration
- Writes to new document or replaces original
- Works on single files or whole directories
- Reports parsing errors in SAS/c message browser
- Consists of a fast C program for doing the conversion and an ARexx
script to deal with user handling
NEW FEATURES
- Improved user control over documents with faulty HTML
- Improved CR/LF handling
- Fixed a couple of minor bugs
SPECIAL REQUIREMENTS
ARexx installed and running.
Rexxdossupport.library, Copyright by Hartmut Goebel. Available from
aminet:util/rexx/rexxdossupport.lha
AVAILABILITY
aminet:comm/www/RaiskaaHTML.lha
PRICE
Freeware.
DISTRIBUTABILITY
Freely distributable as long no files are added or removed from the
archive.