HTML to text converter
Version 1.0, January 22, 1997

Introduction

This is a HTML to text converter. It removes all tags from HTML files, substitutes HTML symbols for ASCII symbols, and removes extraneous blank lines.

This converter is freely available on the Internet, and is provided free of charge. It may be freely distributed with all files intact, but I would like to be informed if it is distributed on any packages. Users must NOT be charged for this converter.

It will hopefully be available for Linux (It is compiled using Red Hat release 3.0.3 (Picasso)), OS/2 Warp 3.0, DOS, and Windows 3.1 (95 and NT). Download the latest version from your favourite FTP site, or check at my home page.


Installation and use

For Linux, ask your System Administrator to put the executable html2txt int the /usr/bin directory.

For OS/2, DOS, and Windows, place the executable HTML2TXT.EXE in any directory, but preferably in your path.

For OS/2, you may like to create a program object for the executable. Then drop any HTML file onto this object, and it will convert it for you. (You might need the emx.dll runtime library.)

For Windows 3.1/95/NT, create an icon (or a shadow) pointing to the executable. You may need to use the icon's arguments to set the file to convert, or using the File Manager, drop the HTML file icon onto the executable's icon.


Usage modes

Usage is simple. Type

html2txt HTML file

Where the first argument, HTML file, is the file to convert. A new file, result.out, contains the resultant text file. Edit it to suit your needs.

Alternatively, type

html2txt -l HTML file

An additonal file, html.out, lists all HTML tags and symbols detected in HTML file, and the lines numbers where they were found.


Licensing

Everybody may freely use and distribute HTML2TXT, as long as all files are intact. PLEASE register by informing the author by email or post. A nice postcard would be cool. If you have suggestions for improvement, problems, complaints, or a job (!) for me, I would love to hear from you!

Mr Antonino Iannella
6 Bolingbroke Avenue
DEVON PARK SA 5008
AUSTRALIA

Email antonino@usa.net or nettuno@light.iinet.net.au.

Download latest versions from http://members.tripod.com/~antonino.


Version history

Version 1 - Released, no major bugs.