Re: Bug in HTML::Parse

Gisle Aas (aas@bergen.sn.no)
Tue, 01 Oct 1996 07:03:45 +0100


In message <199609301202.NAA27468@fma10.fma.com>, Charlie Stross writes:
> Yep, there _is_ a problem in HTML::Parse. I fixed my difficulty with
> the PICS headers by calling HTML::Entity::Decode in the right place,
> only to discover that the parser loses information about the _type_
> of quoting being used in HTML elements. For example:
> 
> <META NAME="foo" CONTENT='This content contains "embedded information"'>
> 
> turns into:
> 
> <META NAME="foo" CONTENT="This content contains "embedded information"">
> 
> ... which is obviously wrong.

Yes, obviously.

> (NB: Using LWP 5.01).

Upgrade to LWP 5.03 and you should be OK.  In 5.03 we will use 
single-quotes if the attribute value contains more double-quotes than 
single-quotes.

Regards,
Gisle.