HTML parser in perl

Jim Davis (davis@dri.cornell.edu)
Sat, 3 Sep 1994 15:16:08 -0400


I have a new version of the HTML parser in perl.  It fixes no bugs,
but adds suggestions people have sent in
 * if stack does not end up empty, stack is printed to STDERR not STDOUT
 * new routine html_begin_doc and html_end_doc allow for user-specified
   initialization and cleanup code.
 * new interface parse_html_stream

The latter two are useful when you want to parse a series of HTML files
or objects, and they either are not in files or you don't want to use
the file interface for some reason.

ftp://dri.cornell.edu/pub/davis/html-parser.tar.Z