Re: Libwww suggestions

Martijn Koster (m.koster@nexor.co.uk)
Fri, 26 Aug 1994 17:34:52 +0100


Marc wrote:

> Also, does anyone know of perl code for converting HTML entities
> to their 8bit text equivalents in iso-8859-1? Or maybe their appropriate
> if crude ASCII renditions?  And possibly back the other way (at least
> with the 8bit stuff)?  It seems that would be a useful addition to wwwhtml.

I did hack a script to read ISOLat1 and produce a table, from
which you can parse some 'crude' renditions.  See
http://web.nexor.co.uk/mak/misc/charset.html

If you do 'www -n http://web/mak/misc/charset.html | grep '; \[.\]' and pipe 
it to a file, then you can parse that to get the iso8859 codes.

Then just manually fill in any blanks, shove them into a number of
%'s, and you're set.

-- Martijn
__________
Internet: m.koster@nexor.co.uk
X-400: C=GB; A= ; P=Nexor; O=Nexor; S=koster; I=M
X-500: c=GB@o=NEXOR Ltd@cn=Martijn Koster
WWW: http://web.nexor.co.uk/mak/mak.html