Re: URL->abs bug.
Axel Ramge (axel@ramge.de)
Tue, 5 Aug 1997 09:27:36 +0200
Randal Schwartz writes:
> >>>>> "Gisle" == Gisle Aas <aas@bergen.sn.no> writes:
>
> Gisle> The reason URI::URL does this is that RFC1738 says:
>
> Gisle> In some cases, extra whitespace (spaces, linebreaks, tabs, etc.) may
> Gisle> need to be added to break long URLs across lines. The whitespace
> Gisle> should be ignored when extracting the URL.
>
> Gisle> Whitespace should not really appear in URLs. It should have been
> Gisle> encoded as '%20'.
>
> Gisle> Does anybody think we should get rid of the whitespace remover
> Gisle> in URI::URL?
>
> I'm an RFC-kinda-guy. So, count me as liking the current behavior.
Following rfc1738 whitespace *has* to be removed, because
(white-)space is considered to be an unsafe character which *must* be
encoded. So if there really is a literal whitespace it should be
removed. But probably this is a little bit to strict.
May be the most pragmatic solution would be to remove whitespace only
`in context', i.e. at the end (+beginning?) of a line: s/\s*$//smg
Axel