Re: Libwww suggestions
Martijn Koster (m.koster@nexor.co.uk)
Fri, 26 Aug 1994 17:34:52 +0100
Marc wrote:
> Also, does anyone know of perl code for converting HTML entities
> to their 8bit text equivalents in iso-8859-1? Or maybe their appropriate
> if crude ASCII renditions? And possibly back the other way (at least
> with the 8bit stuff)? It seems that would be a useful addition to wwwhtml.
I did hack a script to read ISOLat1 and produce a table, from
which you can parse some 'crude' renditions. See
http://web.nexor.co.uk/mak/misc/charset.html
If you do 'www -n http://web/mak/misc/charset.html | grep '; \[.\]' and pipe
it to a file, then you can parse that to get the iso8859 codes.
Then just manually fill in any blanks, shove them into a number of
%'s, and you're set.
-- Martijn
__________
Internet: m.koster@nexor.co.uk
X-400: C=GB; A= ; P=Nexor; O=Nexor; S=koster; I=M
X-500: c=GB@o=NEXOR Ltd@cn=Martijn Koster
WWW: http://web.nexor.co.uk/mak/mak.html