Re: Annoying URL...

Marvin Simkin (Marvin.Simkin@aexp.com)
12 Apr 2000 07:04:02 -0700


Works fine for me using old libwww-perl-0.40.

Content-location is not a redirect, so don't try to follow it. It merely =
tells =

you where the content was found, since you didn't specify a filename on t=
he =

URL.


GET http://www.svenskakyrkan.se/stift/harnosand/ HTTP/1.0
Host: www.svenskakyrkan.se
Pragma: no-cache
User-Agent: Mozilla/4.03 [en] (MOMspider)

HTTP/1.1 200 OK
Server: Microsoft-IIS/4.0
Content-location: http://www.svenskakyrkan.se/stift/harnosand/Index.htm
Set-cookie: SITESERVER=3DID=3D77f490b5a0d0451693378d1a9e791d8f; expires=3D=
Monday, =

01-Jan-2035 00:00:00 GMT; path=3D/; domain=3D.svenskakyrkan.se
Date: Wed, 12 Apr 2000 13:58:33 GMT
Content-type: text/html
Accept-ranges: bytes
Last-modified: Thu, 06 Apr 2000 10:58:35 GMT
Etag: "104754eb79fbf1:11695"
Content-length: 828

H=E4rn=F6sands stift p=E5 Internet <body> <p>P=E5 den h=E4r sidan anv=E4nds ramar som inte st=F6ds av din webbl=E4= sare.</p> </body>




From:	jlpoutre%corp.nl.home.com@Internet on 2000-04-12 06:08 AM
To:	Mattias.Borell%lub.lu.se@Internet
cc:	libwww%perl.org@Internet (bcc: Marvin Simkin)
Subject:	Re: Annoying URL...

>Hi all.
>
>Within a local perl-based project we needed some link-checking to be don=
e,
>and
>naturally we used LWP for the job, and it works in it usual charming way=
=2E
>However, we've stumbled across a specific URL that defies fetching with =
our
>code, or HEAD/GET as supplied with LWP.
>
>When you try to reach it with Netscape or *shudder* HotJava, it loads as=
 it
>should. Lynx just freezes after loading a little data...
>
>So, could anyone shed any light on what's going on at a protocol (HTTP) =
level
>here? I can't seem to debug this properly...
>
> URL: http://www.svenskakyrkan.se/stift/harnosand/
>
>And yes, it's a IIS-server, we've found that much out. :->
>

Weird, the first try with HEAD got me a "500 read timeout" error, but at
the second try this response:

bash-2.01$ HEAD http://www.svenskakyrkan.se/stift/harnosand/
200 OK
Date: Wed, 12 Apr 2000 13:01:51 GMT
Accept-Ranges: bytes
Server: Microsoft-IIS/4.0
Content-Length: 828
Content-Location: http://www.svenskakyrkan.se/stift/harnosand/Index.htm
Content-Type: text/html
ETag: "104754eb79fbf1:11695"
Last-Modified: Thu, 06 Apr 2000 10:58:35 GMT
Client-Date: Wed, 12 Apr 2000 13:02:51 GMT
Client-Peer: 195.17.98.104:80

Seems you need to follow the "Content-Location:" url; typycally NT to hav=
e
that capitalized Index.htm file...

Hope this helps!

Regards,

Johannes la Poutre
Content Software Engineer

--
@Home Benelux BV
Gyroscoopweg 90-92
1042 AX  Amsterdam
The Netherlands
Tel. +31(0)20 88 555 68
Fax. +31(0)20 88 555 22
Mobile: +31(0)6 218 555 03
http://www.home.nl