Re: URL->abs bug.

Axel Ramge (axel@ramge.de)
Tue, 5 Aug 1997 09:27:36 +0200


Randal Schwartz writes:
 > >>>>> "Gisle" == Gisle Aas <aas@bergen.sn.no> writes:
 > 
 > Gisle> The reason URI::URL does this is that RFC1738 says:
 > 
 > Gisle>    In some cases, extra whitespace (spaces, linebreaks, tabs, etc.) may
 > Gisle>    need to be added to break long URLs across lines.  The whitespace
 > Gisle>    should be ignored when extracting the URL.
 > 
 > Gisle> Whitespace should not really appear in URLs.  It should have been
 > Gisle> encoded as '%20'.
 > 
 > Gisle> Does anybody think we should get rid of the whitespace remover
 > Gisle> in URI::URL?
 > 
 > I'm an RFC-kinda-guy.  So, count me as liking the current behavior.

Following rfc1738 whitespace *has* to be removed, because
(white-)space is considered to be an unsafe character which *must* be
encoded. So if there really is a literal whitespace it should be
removed. But probably this is a little bit to strict. 

May be the most pragmatic solution would be to remove whitespace only
`in context', i.e. at the end (+beginning?) of a line: s/\s*$//smg

Axel