Re: problems with extract_links on a frame tag

Luuk de Boer (luuk_de_boer@pi.net)
Mon, 6 Jan 1997 07:18:31 +1.00


On  3 Jan 97 at 9:33, Martijn Koster wrote:

> At 1:38 AM 1/3/97, Luuk de Boer wrote:
> 
> > I have a problem with extract_links and LinkExtor.pm.
> > ... So I don't know if [...] nobody care's.
> 
> He, I can't let a fellow countryman think that, now can I :-)

thanks :-)
> 
> >here's my test page which I use: (index3.htm)
> >here is my test perl code (problem.pl):
> >here's the output of problem.pl (problem.txt):
> 
> Did everyone make producing reproducable test cases their
> new-years resolution? Wonderful! :-)
> 
> 
> HTML:: is definately Gisle's turf, and I only had a quick look, but
> I found that adding frame and frameset to HTML::TreeBuilder.pm's
> %isBodyelement made your scripts return the missing links. Make sure
> you resolve them to relative to the base URL, sub extract_links
> seems to give them back raw:
> 
> link = menu.htm
> link = main.htm
> 
> Hope this is of use/interest...
> 

Yep you fixed it. Thanks. I looked a little further and saw also that 
embed isn't in the list. So there are some html tags difference 
between LinkExtor.pm and TreeBuilder.pm/Element.pm. I hope Gisle can 
make this correct in the next release so everything get the same 
result.

1000 x thanks.

Greetz...

Luuk
 ______________________________________________________________
| Luuk de Boer, luuk@pi.net                                    |
| Handleidingen voor het installeren van Shareware programma's |
| voor Planet Internet: http://www.pi.net/~luuk/               |
|--------------------------------------------------------------|