Re: problems with extract_links on a frame tag
Luuk de Boer (luuk_de_boer@pi.net)
Mon, 6 Jan 1997 07:18:31 +1.00
On 3 Jan 97 at 9:33, Martijn Koster wrote:
> At 1:38 AM 1/3/97, Luuk de Boer wrote:
>
> > I have a problem with extract_links and LinkExtor.pm.
> > ... So I don't know if [...] nobody care's.
>
> He, I can't let a fellow countryman think that, now can I :-)
thanks :-)
>
> >here's my test page which I use: (index3.htm)
> >here is my test perl code (problem.pl):
> >here's the output of problem.pl (problem.txt):
>
> Did everyone make producing reproducable test cases their
> new-years resolution? Wonderful! :-)
>
>
> HTML:: is definately Gisle's turf, and I only had a quick look, but
> I found that adding frame and frameset to HTML::TreeBuilder.pm's
> %isBodyelement made your scripts return the missing links. Make sure
> you resolve them to relative to the base URL, sub extract_links
> seems to give them back raw:
>
> link = menu.htm
> link = main.htm
>
> Hope this is of use/interest...
>
Yep you fixed it. Thanks. I looked a little further and saw also that
embed isn't in the list. So there are some html tags difference
between LinkExtor.pm and TreeBuilder.pm/Element.pm. I hope Gisle can
make this correct in the next release so everything get the same
result.
1000 x thanks.
Greetz...
Luuk
______________________________________________________________
| Luuk de Boer, luuk@pi.net |
| Handleidingen voor het installeren van Shareware programma's |
| voor Planet Internet: http://www.pi.net/~luuk/ |
|--------------------------------------------------------------|