Re: HTML::Parser (HTML::LinkExtor) Broken?

Ed (ed@chronos.net)
Thu, 11 Mar 1999 11:57:44 +0000 (WET)


On Sat, 6 Mar 1999, Matthew Keller wrote:
[...]
> 	This returns all of the AREA elements marked with 'SHAPE="circle"', but
> none of the 'rect' areas. This page has a total of *27* link elements,
> but only *7* are returned, because 20 of them are AREA elements with
> 'SHAPE="rect"'
[...]
> -- Begin HTML Snippet --
> <AREA SHAPE="circle" COORDS="582,149,51"
> HREF="http://mattwork.potsdam.edu/friends.htm" ALT="My Friends">
> <AREA SHAPE="rect" COORDS="3,401,198,440",
> HREF="http://mattwork.potsdam.edu/Me/" ALT="Me Stuff">
> -- End HTML Snippet --

I believe that the breakage is not what you think it is. If you
delete the ',' from the end of the COORDS= attribute of the second
element, it starts working. I'm not sure how tolerant HTML::Parser
is supposed to be of such bad SGML, but this is the root of the
problem.
-- 
Ed          |                         | mailto:ed@chronos.net
Chronos.Net | http://www.chronos.net/ | mailto:info@chronos.net