Re: Accept-Charset support
Chris Lilley (Chris.Lilley@sophia.inria.fr)
Wed, 18 Dec 1996 01:21:58 +0100 (MET)
On Dec 18, 12:03am, Keld J|rn Simonsen wrote:
[in amongst generally good stuff ]
> Of cause it complicates matters
> with yet another parameter, but it could help in chosing an
> appropiate font, and then it is the right concept.
Sequences of bytes, sequences of characters, and sequences of glyphs
are not the same thing. The charset does not, necessarily, map 1:1 to
the font encoding vector. The characters might not even be displayed
visually - they might be spoken for example.
> I note that a MIME charset identifies a repertoire,
Not necessarily. It identifies a mapping from (one or more)
bytes to characters. In the particular case of HTML it does not identify
a repertoire at all. It is possible to write a document that contains the
entire character repertoire of 10646, and have it correctly labelled as
US-ASCII - just using numeric character references.
--
Chris Lilley, W3C [ http://www.w3.org/ ]
Graphics and Fonts Guy The World Wide Web Consortium
http://www.w3.org/people/chris/ INRIA, Projet W3C
chris@w3.org 2004 Rt des Lucioles / BP 93
+33 (0)4 93 65 79 87 06902 Sophia Antipolis Cedex, France