users@jaxb.java.net

Re: UTF-8 encoding broken in JAXB 1.0 marshaller

From: Malachi de AElfweald <malachi_at_EOTI.ORG>
Date: Thu, 06 Mar 2003 09:25:34 -0700

On Thu, 6 Mar 2003 07:38:44 -0700, Bernhard Mandl <bmandl_at_ITV.GLOBALREFUND.COM> wrote:
>For example the character "?" (o with two dots above it) has the code 246 decimal or 0xF6 in the ANSI codepage 1252 (Windows Latin 1)

Actually, this info is wrong. Most of Windows uses Cp1252, which is their extension to 8859-1. However, the DOS prompt is ANSI, Cp437. The Unicode character for "Latin Capital Letter U with Diaeresis" is 00D6. The ANSI character for it is 153. For the ansi character 246 that you mentioned, it is actually the symbol "Division Sign", Unicode character 00F7.

Malachi