Re: URI::Heuristic
acassin@cs.mu.oz.au
Tue, 28 Oct 1997 11:29:48 +1100 (EST)
Folks,
I just noticed an article in the latest comp.risks digest which is
relevant for URI::Heuristic (and its users).
Andrew Cassin
acassin@cs.mu.oz.au
--- from comp.risks 19.42 ---
Date: Thu, 23 Oct 1997 17:03:22 +0100 (BST)
From: dom@inta.net
Subject: When taking a guess isn't so smart
Having in the past given up with the domain name private.org due to
the amount of irrelevant e-mail I received to it, you would have
thought I would know better.
I own the domain name HTTP.ORG. The web site for that domain has
always received a fair amount of seemingly random hits, presumably
from people mistyping URLs and having their domain suffix search
order including ORG.
Recently however I have noticed a dramatic increase in the number
of hits to www.http.org, so I decided to investigate further. Looking
at the access logs, it did appear that the recent upsurge in hits
could also attributed to mistypes. But why the very recent increase ?
The common link is Lynx web browser 2.7.1.
It would appear that the latest version of Lynx likes to pretend to
be clever and guesses at a URL if it doesn't receive a response from
the host that you typed. Unfortunately this means that a request for
any of the following typos:
lynx http//:www.somedomain.com
lynx http//www.somedomain.com
lynx http/:www.somedomain.com
lynx http/www.somedomain.com
Will result in Lynx trying...
Looking up 'http' first.
Looking up 'www.http.com', guessing...
Looking up 'www.http.edu', guessing...
Looking up 'www.http.net', guessing...
Looking up 'www.http.org', guessing...
Getting http://www.http.org//www.somedomain.com
At which point my site is queried.
The risks are obvious. Confusion reigns and I receive a constant flow of
hate e-mail from users all around the world who think that I have hijacked
the web sites they are trying to reach. *sigh*
Dominic J. Hulewicz - mailto:dom@inta.net - http://www.intanet.com/dom
[Dominic CC:ed a lynx development list on this contribution.
I have carefully removed that address here, but am now getting
copied on requests to be removed from that list! I never sausage lynx
before. PGN]