Skip Headers

Oracle® Database Globalization Support Guide
10g Release 1 (10.1)

Part Number B10749-02
Go to Documentation Home
Home
Go to Book List
Book List
Go to Table of Contents
Contents
Go to Index
Index
Go to Master Index
Master Index
Go to Feedback page
Feedback

Go to previous page
Previous
Go to next page
Next
View PDF

A Locale Data

This appendix lists the languages, territories, character sets, and other locale data supported by the Oracle server. It includes these topics:

You can obtain information about character sets, languages, territories, and sorting orders by querying the V$NLS_VALID_VALUES dynamic performance view.


See Also:

Oracle Database Reference for more information about the data that can be returned by this view

Languages

Languages in Table A-1 provide support for locale-sensitive information such as the following:

By using Unicode databases and datatypes, you can store, process, and retrieve data for almost all contemporary languages, including many that do not appear in Table A-1.

Table A-1 Oracle Supported Languages

Language Name Language Abbreviation Default Sort
AMERICAN us binary
ARABIC ar ARABIC
ASSAMESE as binary
AZERBAIJANI az AZERBAIJANI
BANGLA bn binary
BRAZILIAN PORTUGUESE ptb WEST_EUROPEAN
BULGARIAN bg BULGARIAN
CANADIAN FRENCH frc CANADIAN FRENCH
CATALAN ca CATALAN
CROATIAN hr CROATIAN
CYRILLIC KAZAKH ckk GENERIC_M
CYRILLIC SERBIAN csr GENERIC_M
CYRILLIC UZBEK cuz GENERIC_M
CZECH cs CZECH
DANISH dk DANISH
DUTCH nl DUTCH
EGYPTIAN eg ARABIC
ENGLISH gb binary
ESTONIAN et ESTONIAN
FINNISH sf FINNISH
FRENCH f FRENCH
GERMAN DIN din GERMAN
GERMAN d GERMAN
GREEK el GREEK
GUJARATI gu binary
HEBREW iw HEBREW
HINDI hi binary
HUNGARIAN hu HUNGARIAN
ICELANDIC is ICELANDIC
INDONESIAN in INDONESIAN
ITALIAN i WEST_EUROPEAN
JAPANESE ja binary
KANNADA kn binary
KOREAN ko binary
LATIN AMERICAN SPANISH esa SPANISH
LATIN SERBIAN lsr binary
LATIN UZBEK luz GENERIC_M
LATVIAN lv LATVIAN
LITHUANIAN lt LITHUANIAN
MACEDONIAN mk binary
MALAY ms MALAY
MALAYALAM ml binary
MARATHI mr binary
MEXICAN SPANISH esm WEST_EUROPEAN
NORWEGIAN n NORWEGIAN
ORIYA or binary
POLISH pl POLISH
PORTUGUESE pt WEST_EUROPEAN
PUNJABI pa binary
ROMANIAN ro ROMANIAN
RUSSIAN ru RUSSIAN
SIMPLIFIED CHINESE zhs binary
SLOVAK sk SLOVAK
SLOVENIAN sl SLOVENIAN
SPANISH e SPANISH
SWEDISH s SWEDISH
TAMIL ta binary
TELUGU te binary
THAI th THAI_DICTIONARY
TRADITIONAL CHINESE zht binary
TURKISH tr TURKISH
UKRAINIAN uk UKRAINIAN
VIETNAMESE vn VIETNAMESE

Translated Messages

Oracle error messages have been translated into the languages which are listed in Table A-2.

Table A-2 Oracle Supported Messages

Name Abbreviation
ARABIC ar
BRAZILIAN PORTUGUESE ptb
CATALAN ca
CZECH cs
DANISH dk
DUTCH nl
FINNISH sf
FRENCH f
GERMAN d
GREEK el
HEBREW iw
HUNGARIAN hu
ITALIAN i
JAPANESE ja
KOREAN ko
NORWEGIAN n
POLISH pl
PORTUGUESE pt
ROMANIAN ro
RUSSIAN ru
SIMPLIFIED CHINESE zhs
SLOVAK sk
SPANISH e
SWEDISH s
THAI th
TRADITIONAL CHINESE zht
TURKISH tr

Territories

Table A-3 lists the territories supported by the Oracle server.

Table A-3 Oracle Supported Territories

Name Name Name
ALGERIA HONG KONG PERU
- - PHILIPPINES
AMERICA HUNGARY POLAND
AUSTRALIA ICELAND PORTUGAL
AUSTRIA INDIA PUERTO RICO
BAHRAIN INDONESIA QATAR
BANGLADESH IRAQ ROMANIA
- - RUSSIA
BELGIUM IRELAND SAUDI ARABIA
- - SERBIA AND MONTENEGRO
BRAZIL ISRAEL SINGAPORE
BULGARIA ITALY SLOVAKIA
CANADA JAPAN SLOVENIA
CATALONIA JORDAN SOMALIA
CHILE KAZAKHSTAN SOUTH AFRICA
CHINA KOREA SPAIN
ARGENTINA - -
AZERBAIJAN KUWAIT SUDAN
COLOMBIA LATVIA SWEDEN
COSTA RICA LEBANON SWITZERLAND
CROATIA LIBYA SYRIA
CYPRUS LITHUANIA TAIWAN
CZECH REPUBLIC LUXEMBOURG THAILAND
DENMARK - THE NETHERLANDS
DJIBOUTI MALAYSIA TUNISIA
ECUADOR - -
EGYPT MAURITANIA TURKEY
EL SALVADOR MEXICO UKRAINE
ESTONIA MOROCCO UNITED ARAB EMIRATES
FINLAND NEW ZEALAND UNITED KINGDOM
FRANCE NICARAGUA UZBEKISTAN
FYR MACEDONIA - VENEZUELA
GUATEMALA NORWAY VIETNAM
GERMANY OMAN YEMEN
GREECE PANAMA -

Character Sets

Oracle-supported character sets are listed in the following sections according to three broad language groups.

In addition, common subset/superset combinations are listed.

Note that some character sets may be listed under multiple language groups because they provide multilingual support. For instance, Unicode spans the Asian, European, and Middle Eastern language groups because it supports most of the major scripts of the world.

The comment section indicates the type of encoding used:

SB = Single-byte encoding
MB = Multibyte encoding
FIXED = Fixed-width multibyte encoding

As mentioned in Chapter 3, "Setting Up a Globalization Support Environment", the type of encoding affects performance, so use the most efficient encoding that meets your language needs. Also, some encoding types can only be used with certain data types. For instance, the AL16UTF16 character set can only be used as an NCHAR character set, and not as a database character set.

Also documented in the comment section are other unique features of the character set that may be important to users or your database administrator. For example, the information includes whether the character set supports the euro currency symbol, whether user-defined characters are supported, and whether the character set is a strict superset of ASCII. (You can use the ALTER DATABASE CHARACTER SET statement to migrate from the US7ASCII character set to a character set that is a strict superset of ASCII.)

The following is the key for the comment column of the character set tables:

EURO: euro symbol supported
UDC: user-defined characters supported
ASCII: strict superset of ASCII
MB: multibyte
SB: single-byte

Oracle does not document individual code page layouts. For specific details about a particular character set, its character repertoire, and code point values, you can use Oracle Locale Builder may be used. Otherwise, you should refer to the actual national, international, or vendor-specific standards.

Asian Language Character Sets

Table A-4 lists the Oracle character sets that can support Asian languages. The list is alphabetical by description.

Table A-4 Asian Language Character Sets

Name Description Comments
BN8BSCII Bangladesh National Code 8-bit BSCII SB, ASCII
ZHT16BIG5 BIG5 16-bit Traditional Chinese MB, ASCII
ZHS16CGB231280 CGB2312-80 16-bit Simplified Chinese MB, ASCII
JA16EUC EUC 24-bit Japanese MB, ASCII
JA16EUCTILDE The same as JA16EUC except for the way that the wave dash and the tilde are mapped to and from Unicode. MB, ASCII
JA16EUCYEN EUC 24-bit Japanese with '\' mapped to the Japanese yen character MB
ZHT32EUC EUC 32-bit Traditional Chinese MB, ASCII
ZHS16GBK GBK 16-bit Simplified Chinese MB, ASCII, UDC
ZHS32GB18030 GB18030-2000 MB, ASCII, EURO
ZHT16CCDC HP CCDC 16-bit Traditional Chinese MB, ASCII
JA16DBCS IBM EBCDIC 16-bit Japanese MB, UDC
JA16EBCDIC930 IBM DBCS Code Page 290 16-bit Japanese MB, UDC
KO16DBCS IBM EBCDIC 16-bit Korean MB, UDC
ZHS16DBCS IBM EBCDIC 16-bit Simplified Chinese MB, UDC
ZHT16DBCS IBM EBCDIC 16-bit Traditional Chinese MB, UDC
JA16VMS JVMS 16-bit Japanese MB, ASCII
KO16KSC5601 KSC5601 16-bit Korean MB, ASCII
KO16KSCCS KSCCS 16-bit Korean MB, ASCII
ZHS16MACCGB231280 Mac client CGB2312-80 16-bit Simplified Chinese MB
JA16MACSJIS Mac client Shift-JIS 16-bit Japanese MB
TH8MACTHAI Mac Client 8-bit Latin/Thai SB
TH8MACTHAIS Mac Server 8-bit Latin/Thai SB, ASCII
TH8TISEBCDICS Thai Industrial Standard 620-2533-EBCDIC Server 8-bit SB
KO16MSWIN949 MS Windows Code Page 949 Korean MB, ASCII, UDC
ZHT16HKSCS MS Windows Code Page 950 with Hong Kong Supplementary Character Set HKSCS-2001 (character set conversion to and from Unicode is based on Unicode 3.0) MB, ASCII, EURO
ZHT16MSWIN950 MS Windows Code Page 950 Traditional Chinese MB, ASCII, UDC
VN8MSWIN1258 MS Windows Code Page 1258 8-bit Vietnamese SB, ASCII, EURO
IN8ISCII Multiple-Script Indian Standard 8-bit Latin/Indian Languages SB, ASCII
JA16SJIS Shift-JIS 16-bit Japanese MB, ASCII, UDC
JA16SJISTILDE The same as JA16SJIS except for the way that the wave dash and the tilde are mapped to and from Unicode. MB, ASCII, UDC
JA16SJISYEN Shift-JIS 16-bit Japanese with '\' mapped to the Japanese yen character MB, UDC
ZHT32SOPS SOPS 32-bit Traditional Chinese MB, ASCII
ZHT16DBT Taiwan Taxation 16-bit Traditional Chinese MB, ASCII
TH8TISASCII Thai Industrial Standard 620-2533 - ASCII 8-bit SB, ASCII, EURO
TH8TISEBCDIC Thai Industrial Standard 620-2533 - EBCDIC 8-bit SB
ZHT32TRIS TRIS 32-bit Traditional Chinese MB, ASCII
AL16UTF16 See "Universal Character Sets" for details MB, EURO, FIXED
AL32UTF8 See "Universal Character Sets" for details MB, ASCII, EURO
UTF8 See "Universal Character Sets" for details MB, ASCII, EURO
UTFE See "Universal Character Sets" for details MB, EURO
VN8VN3 VN3 8-bit Vietnamese SB, ASCII

European Language Character Sets

Table A-5 lists the Oracle character sets that can support European languages. The list is alphabetical by description.

Table A-5 European Language Character Sets

Name Description Comments
US7ASCII ASCII 7-bit American SB, ASCII
SF7ASCII ASCII 7-bit Finnish SB
YUG7ASCII ASCII 7-bit Yugoslavian SB
RU8BESTA BESTA 8-bit Latin/Cyrillic SB, ASCII
EL8GCOS7 Bull EBCDIC GCOS7 8-bit Greek SB
WE8GCOS7 Bull EBCDIC GCOS7 8-bit West European SB
EL8DEC DEC 8-bit Latin/Greek SB
TR7DEC DEC VT100 7-bit Turkish SB
TR8DEC DEC 8-bit Turkish SB, ASCII
WE8DEC DEC 8-bit West European SB, ASCII
D7DEC DEC VT100 7-bit German SB
F7DEC DEC VT100 7-bit French SB
S7DEC DEC VT100 7-bit Swedish SB
E7DEC DEC VT100 7-bit Spanish SB
NDK7DEC DEC VT100 7-bit Norwegian/Danish SB
I7DEC DEC VT100 7-bit Italian SB
NL7DEC DEC VT100 7-bit Dutch SB
CH7DEC DEC VT100 7-bit Swiss (German/French) SB
SF7DEC DEC VT100 7-bit Finnish SB
WE8DG DG 8-bit West European SB, ASCII
WE8EBCDIC37C EBCDIC Code Page 37 8-bit Oracle/c SB
WE8EBCDIC37 EBCDIC Code Page 37 8-bit West European SB
D8EBCDIC273 EBCDIC Code Page 273/1 8-bit Austrian German SB
DK8EBCDIC277 EBCDIC Code Page 277/1 8-bit Danish SB
S8EBCDIC278 EBCDIC Code Page 278/1 8-bit Swedish SB
I8EBCDIC280 EBCDIC Code Page 280/1 8-bit Italian SB
WE8EBCDIC284 EBCDIC Code Page 284 8-bit Latin American/Spanish SB
WE8EBCDIC285 EBCDIC Code Page 285 8-bit West European SB
WE8EBCDIC1047 EBCDIC Code Page 1047 8-bit West European SB
WE8EBCDIC1140 EBCDIC Code Page 1140 8-bit West European SB, EURO
WE8EBCDIC1140C EBCDIC Code Page 1140 Client 8-bit West European SB, EURO
WE8EBCDIC1145 EBCDIC Code Page 1145 8-bit West European SB, EURO
WE8EBCDIC1146 EBCDIC Code Page 1146 8-bit West European SB, EURO
WE8EBCDIC1148 EBCDIC Code Page 1148 8-bit West European SB, EURO
WE8EBCDIC1148C EBCDIC Code Page 1148 Client 8-bit West European SB, EURO
F8EBCDIC297 EBCDIC Code Page 297 8-bit French SB
WE8EBCDIC500C EBCDIC Code Page 500 8-bit Oracle/c SB
WE8EBCDIC500 EBCDIC Code Page 500 8-bit West European SB
EE8EBCDIC870 EBCDIC Code Page 870 8-bit East European SB
EE8EBCDIC870C EBCDIC Code Page 870 Client 8-bit East European SB
EE8EBCDIC870S EBCDIC Code Page 870 Server 8-bit East European SB
WE8EBCDIC871 EBCDIC Code Page 871 8-bit Icelandic SB
EL8EBCDIC875 EBCDIC Code Page 875 8-bit Greek SB
EL8EBCDIC875R EBCDIC Code Page 875 Server 8-bit Greek SB
CL8EBCDIC1025 EBCDIC Code Page 1025 8-bit Cyrillic SB
CL8EBCDIC1025C EBCDIC Code Page 1025 Client 8-bit Cyrillic SB
CL8EBCDIC1025R EBCDIC Code Page 1025 Server 8-bit Cyrillic SB
CL8EBCDIC1025S EBCDIC Code Page 1025 Server 8-bit Cyrillic SB
CL8EBCDIC1025X EBCDIC Code Page 1025 (Modified) 8-bit Cyrillic SB
TR8EBCDIC1026 EBCDIC Code Page 1026 8-bit Turkish SB
TR8EBCDIC1026S EBCDIC Code Page 1026 Server 8-bit Turkish SB
BLT8EBCDIC1112 EBCDIC Code Page 1112 8-bit Baltic Multilingual SB
BLT8EBCDIC1112S EBCDIC Code Page 1112 8-bit Server Baltic Multilingual SB
D8EBCDIC1141 EBCDIC Code Page 1141 8-bit Austrian German SB, EURO
DK8EBCDIC1142 EBCDIC Code Page 1142 8-bit Danish SB, EURO
S8EBCDIC1143 EBCDIC Code Page 1143 8-bit Swedish SB, EURO
I8EBCDIC1144 EBCDIC Code Page 1144 8-bit Italian SB, EURO
F8EBCDIC1147 EBCDIC Code Page 1147 8-bit French SB, EURO
CL8EBCDIC1158 EBCDIC Code Page 1158 8-bit Cyrillic SB
CL8EBCDIC1158R EBCDIC Code Page 1158 Server 8-bit Cyrillic SB
EEC8EUROASCI EEC Targon 35 ASCI West European/Greek SB
EEC8EUROPA3 EEC EUROPA3 8-bit West European/Greek SB
LA8PASSPORT German Government Printer 8-bit All-European Latin SB, ASCII
WE8HP HP LaserJet 8-bit West European SB
WE8ROMAN8 HP Roman8 8-bit West European SB, ASCII
HU8CWI2 Hungarian 8-bit CWI-2 SB, ASCII
HU8ABMOD Hungarian 8-bit Special AB Mod SB, ASCII
EL8EBCDIC423R IBM EBCDIC Code Page 423 for RDBMS server-side SB
LV8RST104090 IBM-PC Alternative Code Page 8-bit Latvian (Latin/Cyrillic) SB, ASCII
US8PC437 IBM-PC Code Page 437 8-bit American SB, ASCII
BG8PC437S IBM-PC Code Page 437 8-bit (Bulgarian Modification) SB, ASCII
EL8PC437S IBM-PC Code Page 437 8-bit (Greek modification) SB, ASCII
EL8PC737 IBM-PC Code Page 737 8-bit Greek/Latin SB
LT8PC772 IBM-PC Code Page 772 8-bit Lithuanian (Latin/Cyrillic) SB, ASCII
LT8PC774 IBM-PC Code Page 774 8-bit Lithuanian (Latin) SB, ASCII
BLT8PC775 IBM-PC Code Page 775 8-bit Baltic SB, ASCII
WE8PC850 IBM-PC Code Page 850 8-bit West European SB, ASCII
EL8PC851 IBM-PC Code Page 851 8-bit Greek/Latin SB, ASCII
EE8PC852 IBM-PC Code Page 852 8-bit East European SB, ASCII
RU8PC855 IBM-PC Code Page 855 8-bit Latin/Cyrillic SB, ASCII
TR8PC857 IBM-PC Code Page 857 8-bit Turkish SB, ASCII
WE8PC858 IBM-PC Code Page 858 8-bit West European SB, ASCII, EURO
WE8PC860 IBM-PC Code Page 860 8-bit West European SB. ASCII
IS8PC861 IBM-PC Code Page 861 8-bit Icelandic SB, ASCII
CDN8PC863 IBM-PC Code Page 863 8-bit Canadian French SB, ASCII
N8PC865 IBM-PC Code Page 865 8-bit Norwegian SB. ASCII
RU8PC866 IBM-PC Code Page 866 8-bit Latin/Cyrillic SB, ASCII
EL8PC869 IBM-PC Code Page 869 8-bit Greek/Latin SB, ASCII
LV8PC1117 IBM-PC Code Page 1117 8-bit Latvian SB, ASCII
US8ICL ICL EBCDIC 8-bit American SB
WE8ICL ICL EBCDIC 8-bit West European SB
WE8ISOICLUK ICL special version ISO8859-1 SB
WE8ISO8859P1 ISO 8859-1 West European SB, ASCII
EE8ISO8859P2 ISO 8859-2 East European SB, ASCII
SE8ISO8859P3 ISO 8859-3 South European SB, ASCII
NEE8ISO8859P4 ISO 8859-4 North and North-East European SB, ASCII
CL8ISO8859P5 ISO 8859-5 Latin/Cyrillic SB, ASCII
AR8ISO8859P6 ISO 8859-6 Latin/Arabic SB, ASCII
EL8ISO8859P7 ISO 8859-7 Latin/Greek SB, ASCII, EURO
IW8ISO8859P8 ISO 8859-8 Latin/Hebrew SB, ASCII
NE8ISO8859P10 ISO 8859-10 North European SB, ASCII
BLT8ISO8859P13 ISO 8859-13 Baltic SB, ASCII
CEL8ISO8859P14 ISO 8859-13 Celtic SB, ASCII
WE8ISO8859P15 ISO 8859-15 West European SB, ASCII, EURO
LA8ISO6937 ISO 6937 8-bit Coded Character Set for Text Communication SB, ASCII
CL8ISOIR111 ISOIR111 Cyrillic SB
IW7IS960 Israeli Standard 960 7-bit Latin/Hebrew SB
CL8KOI8U KOI8 Ukrainian Cyrillic SB
WE8EBCDIC1047E Latin 1/Open Systems 1047 SB, EBCDIC, EURO
WE8EBCDIC924 Latin 9 EBCDIC 924 SB, EBCDIC
BLT8CP921 Latvian Standard LVS8-92(1) Windows/Unix 8-bit Baltic SB, ASCII
LV8PC8LR Latvian Version IBM-PC Code Page 866 8-bit Latin/Cyrillic SB, ASCII
AR8ARABICMAC Mac Client 8-bit Latin/Arabic SB
EE8MACCE Mac Client 8-bit Central European SB
EE8MACCROATIAN Mac Client 8-bit Croatian SB
WE8MACROMAN8 Mac Client 8-bit Extended Roman8 West European SB
EL8MACGREEK Mac Client 8-bit Greek SB
IS8MACICELANDIC Mac Client 8-bit Icelandic SB
CL8MACCYRILLIC Mac Client 8-bit Latin/Cyrillic SB
TR8MACTURKISH Mac Client 8-bit Turkish SB
TR8MACTURKISHS Mac Server 8-bit Turkish SB, ASCII
AR8ARABICMACS Mac Server 8-bit Latin/Arabic SB, ASCII
EE8MACCES Mac Server 8-bit Central European SB, ASCII
EE8MACCROATIANS Mac Server 8-bit Croatian SB, ASCII
WE8MACROMAN8S Mac Server 8-bit Extended Roman8 West European SB, ASCII
CL8MACCYRILLICS Mac Server 8-bit Latin/Cyrillic SB, ASCII
EL8MACGREEKS Mac Server 8-bit Greek SB, ASCII
IS8MACICELANDICS Mac Server 8-bit Icelandic SB
BG8MSWIN MS Windows 8-bit Bulgarian Cyrillic SB, ASCII
LT8MSWIN921 MS Windows Code Page 921 8-bit Lithuanian SB, ASCII
ET8MSWIN923 MS Windows Code Page 923 8-bit Estonian SB, ASCII
EE8MSWIN1250 MS Windows Code Page 1250 8-bit East European SB, ASCII, EURO
CL8MSWIN1251 MS Windows Code Page 1251 8-bit Latin/Cyrillic SB, ASCII, EURO
WE8MSWIN1252 MS Windows Code Page 1252 8-bit West European SB, ASCII, EURO
EL8MSWIN1253 MS Windows Code Page 1253 8-bit Latin/Greek SB, ASCII, EURO
TR8MSWIN1254 MS Windows Code Page 1254 8-bit Turkish SB, ASCII, EURO
BLT8MSWIN1257 MS Windows Code Page 1257 8-bit Baltic SB, ASCII, EURO
WE8NCR4970 NCR 4970 8-bit West European SB, ASCII
WE8NEXTSTEP NeXTSTEP PostScript 8-bit West European SB, ASCII
CL8KOI8R RELCOM Internet Standard 8-bit Latin/Cyrillic SB, ASCII
US8BS2000 Siemens 9750-62 EBCDIC 8-bit American SB
DK8BS2000 Siemens 9750-62 EBCDIC 8-bit Danish SB
F8BS2000 Siemens 9750-62 EBCDIC 8-bit French SB
D8BS2000 Siemens 9750-62 EBCDIC 8-bit German SB
E8BS2000 Siemens 9750-62 EBCDIC 8-bit Spanish SB
S8BS2000 Siemens 9750-62 EBCDIC 8-bit Swedish SB
DK7SIEMENS9780X Siemens 97801/97808 7-bit Danish SB
F7SIEMENS9780X Siemens 97801/97808 7-bit French SB
D7SIEMENS9780X Siemens 97801/97808 7-bit German SB
I7SIEMENS9780X Siemens 97801/97808 7-bit Italian SB
N7SIEMENS9780X Siemens 97801/97808 7-bit Norwegian SB
E7SIEMENS9780X Siemens 97801/97808 7-bit Spanish SB
S7SIEMENS9780X Siemens 97801/97808 7-bit Swedish SB
EE8BS2000 Siemens EBCDIC.DF.04 8-bit East European SB
WE8BS2000 Siemens EBCDIC.DF.04 8-bit West European SB
WE8BS2000E Siemens EBCDIC.DF.04 8-bit West European SB, EURO
CL8BS2000 Siemens EBCDIC.EHC.LC 8-bit Cyrillic SB
WE8BS2000L5 Siemens EBCDIC.DF.L5 8-bit West European/Turkish SB
CE8BS2000 Siemens EBCDIC.DF.04 8-bit Celtic SB
AL16UTF16 See "Universal Character Sets" for details MB, EURO, FIXED
AL32UTF8 See "Universal Character Sets" for details MB, ASCII, EURO
UTF8 See "Universal Character Sets" for details MB, ASCII, EURO
UTFE See "Universal Character Sets" for details MB, EURO

Middle Eastern Language Character Sets

Table A-6 lists the Oracle character sets that can support Middle Eastern languages. The list is alphabetical by description.

Table A-6 Middle Eastern Character Sets

Name Description Comments
AR8APTEC715 APTEC 715 Server 8-bit Latin/Arabic SB, ASCII
AR8APTEC715T APTEC 715 8-bit Latin/Arabic SB
AR8ASMO708PLUS ASMO 708 Plus 8-bit Latin/Arabic SB, ASCII
AR8ASMO8X ASMO Extended 708 8-bit Latin/Arabic SB, ASCII
AR8ADOS710 Arabic MS-DOS 710 Server 8-bit Latin/Arabic SB, ASCII
AR8ADOS710T Arabic MS-DOS 710 8-bit Latin/Arabic SB
AR8ADOS720 Arabic MS-DOS 720 Server 8-bit Latin/Arabic SB, ASCII
AR8ADOS720T Arabic MS-DOS 720 8-bit Latin/Arabic SB
TR7DEC DEC VT100 7-bit Turkish SB
TR8DEC DEC 8-bit Turkish SB
WE8EBCDIC37C EBCDIC Code Page 37 8-bit Oracle/c SB
IW8EBCDIC424 EBCDIC Code Page 424 8-bit Latin/Hebrew SB
IW8EBCDIC424S EBCDIC Code Page 424 Server 8-bit Latin/Hebrew SB
WE8EBCDIC500C EBCDIC Code Page 500 8-bit Oracle/c SB
IW8EBCDIC1086 EBCDIC Code Page 1086 8-bit Hebrew SB
AR8EBCDIC420S EBCDIC Code Page 420 Server 8-bit Latin/Arabic SB
AR8EBCDICX EBCDIC XBASIC Server 8-bit Latin/Arabic SB
TR8EBCDIC1026 EBCDIC Code Page 1026 8-bit Turkish SB
TR8EBCDIC1026S EBCDIC Code Page 1026 Server 8-bit Turkish SB
AR8HPARABIC8T HP 8-bit Latin/Arabic SB
TR8PC857 IBM-PC Code Page 857 8-bit Turkish SB, ASCII
IW8PC1507 IBM-PC Code Page 1507/862 8-bit Latin/Hebrew SB, ASCII
AR8ISO8859P6 ISO 8859-6 Latin/Arabic SB, ASCII
IW8ISO8859P8 ISO 8859-8 Latin/Hebrew SB, ASCII
AZ8ISO8859PE ISO 8859-9 Latin Azerbaijani SB, ASCII
WE8ISO8859P9 ISO 8859-9 West European & Turkish SB, ASCII
LA8ISO6937 ISO 6937 8-bit Coded Character Set for Text Communication SB, ASCII
IW7IS960 Israeli Standard 960 7-bit Latin/Hebrew SB
IW8MACHEBREW Mac Client 8-bit Hebrew SB
AR8ARABICMAC Mac Client 8-bit Latin/Arabic SB
AR8ARABICMACT Mac 8-bit Latin/Arabic SB
TR8MACTURKISH Mac Client 8-bit Turkish SB
IW8MACHEBREWS Mac Server 8-bit Hebrew SB, ASCII
AR8ARABICMACS Mac Server 8-bit Latin/Arabic SB, ASCII
TR8MACTURKISHS Mac Server 8-bit Turkish SB, ASCII
TR8MSWIN1254 MS Windows Code Page 1254 8-bit Turkish SB, ASCII, EURO
IW8MSWIN1255 MS Windows Code Page 1255 8-bit Latin/Hebrew SB, ASCII, EURO
AR8MSWIN1256 MS Windows Code Page 1256 8-Bit Latin/Arabic SB. ASCII, EURO
IN8ISCII Multiple-Script Indian Standard 8-bit Latin/Indian Languages SB
AR8MUSSAD768 Mussa'd Alarabi/2 768 Server 8-bit Latin/Arabic SB, ASCII
AR8MUSSAD768T Mussa'd Alarabi/2 768 8-bit Latin/Arabic SB
AR8NAFITHA711 Nafitha Enhanced 711 Server 8-bit Latin/Arabic SB, ASCII
AR8NAFITHA711T Nafitha Enhanced 711 8-bit Latin/Arabic SB
AR8NAFITHA721 Nafitha International 721 Server 8-bit Latin/Arabic SB, ASCII
AR8NAFITHA721T Nafitha International 721 8-bit Latin/Arabic SB
AR8SAKHR706 SAKHR 706 Server 8-bit Latin/Arabic SB, ASCII
AR8SAKHR707 SAKHR 707 Server 8-bit Latin/Arabic SB, ASCII
AR8SAKHR707T SAKHR 707 8-bit Latin/Arabic SB
AR8XBASIC XBASIC 8-bit Latin/Arabic SB
WE8BS2000L5 Siemens EBCDIC.DF.04.L5 8-bit West European/Turkish SB
AL16UTF16 See "Universal Character Sets" for details MB, EURO, FIXED
AL32UTF8 See "Universal Character Sets" for details MB, ASCII, EURO
UTF8 See "Universal Character Sets" for details MB, ASCII, EURO
UTFE See "Universal Character Sets" for details MB, EURO

Universal Character Sets

Table A-7 lists the Oracle character sets that provide universal language support. They attempt to support all languages of the world, including, but not limited to, Asian, European, and Middle Eastern languages.

Table A-7 Universal Character Sets

Name Description Comments
AL16UTF16 Unicode 3.2 UTF-16 Universal character set MB, EURO, FIXED
AL32UTF8 Unicode 3.2 UTF-8 Universal character set MB, ASCII, EURO
UTF8 Unicode 3.0 UTF-8 Universal character set, CESU-8 compliant MB, ASCII, EURO
UTFE EBCDIC form of Unicode 3.0 UTF-8 Universal character set (UTF-EBCDIC) MB, EURO

Note:

CESU-8 defines an encoding scheme for Unicode that is identical to UTF-8 except for its representation of supplementary characters. In CESU-8, supplementary characters are represented as six-byte sequences that result from the transformation of each UTF-16 surrogate code unit into an eight-bit form that is similar to the UTF-8 transformation, but without first converting the input surrogate pairs to a scalar value. See Unicode Technical Report #26.

Character Set Conversion Support

The following character set encodings are supported for conversion only. They cannot be used as the database or national character set:

AL16UTF16LE
ISO2022-CN
ISO2022-JP
ISO2022-KR
HZ-GB-2312

You can use these character sets as the source_char_set or dest_char_set in the CONVERT function.


See Also:

Subsets and Supersets

Table A-8 lists common subset/superset relationships.

Table A-8 Subset-Superset Pairs

Subset Superset
AR8ADOS710 AR8ADOS710T
AR8ADOS720 AR8ADOS720T
AR8ADOS720T AR8ADOS720
AR8APTEC715 AR8APTEC715T
AR8ARABICMACT AR8ARABICMAC
AR8ISO8859P6 AR8ASMO708PLUS
AR8ISO8859P6 AR8ASMO8X
AR8MUSSAD768 AR8MUSSAD768T
AR8MUSSAD768T AR8MUSSAD768
AR8NAFITHA711 AR8NAFITHA711T
AR8NAFITHA721 AR8NAFITHA721T
AR8SAKHR707 AR8SAKHR707T
AR8SAKHR707T AR8SAKHR707
BLT8CP921 BLT8ISO8859P13
BLT8CP921 LT8MSWIN921
D7DEC D7SIEMENS9780X
D7SIEMENS9780X D7DEC
DK7SIEMENS9780X N7SIEMENS9780X
I7DEC I7SIEMENS9780X
I7SIEMENS9780X IW8EBCDIC424
IW8EBCDIC424 IW8EBCDIC1086
KO16KSC5601 KO16MSWIN949
LT8MSWIN921 BLT8ISO8859P13
LT8MSWIN921 BLT8CP921
N7SIEMENS9780X DK7SIEMENS9780X
US7ASCII See Table A-9, "US7ASCII Supersets".
UTF8 AL32UTF8
WE8DEC TR8DEC
WE8DEC WE8NCR4970
WE8ISO8859P1 WE8MSWIN1252
WE8ISO8859P9 TR8MSWIN1254
WE8NCR4970 TR8DEC
WE8NCR4970 WE8DEC
WE8PC850 WE8PC858

US7ASCII is a special case because so many other character sets are supersets of it. Table A-9 lists supersets for US7ASCII.

Table A-9 US7ASCII Supersets

Supersets Supersets Supersets
- EE8MACCES NEE8ISO8859P4
AL32UTF8 EE8MACCROATIANS RU8BESTA
AR8ADOS710 EE8MSWIN1250 RU8PC855
AR8ADOS710T EE8PC852 RU8PC866
AR8ADOS720 EL8DEC SE8ISO8859P3
AR8ADOS720T EL8ISO8859P7 TH8MACTHAIS
AR8APTEC715 EL8MACGREEKS TH8TISASCII
AR8APTEC715T EL8MSWIN1253 TR8DEC
AR8ARABICMACS EL8PC437S TR8MACTURKISHS
AR8ASMO708PLUS EL8PC851 TR8MSWIN1254
AR8ASMO8X EL8PC869 TR8PC857
AR8HPARABIC8T ET8MSWIN923 US8PC437
AR8ISO8859P6 HU8ABMOD UTF8
AR8MSWIN1256 HU8CWI2 VN8MSWIN1258
AR8MUSSAD768 IN8ISCII VN8VN3
AR8MUSSAD768T IS8PC861 WE8DEC
AR8NAFITHA711 IW8ISO8859P8 WE8DG
AR8NAFITHA711T IW8MACHEBREWS WE8ISO8859P1
AR8NAFITHA721 IW8MSWIN1255 WE8ISO8859P15
AR8NAFITHA721T IW8PC1507 WE8ISO8859P9
AR8SAKHR706 JA16EUC WE8MACROMAN8S
AR8SAKHR707 JA16SJIS WE8MSWIN1252
AR8SAKHR707T - WE8NCR4970
AZ8ISO8859PE - -
BG8MSWIN - WE8NEXTSTEP
BG8PC437S JA16VMS WE8PC850
BLT8CP921 KO16KSC5601 WE8PC858
BLT8ISO8859P13 KO16KSCCS WE8PC860
BLT8MSWIN1257 KO16MSWIN949 WE8ROMAN8
BLT8PC775 - ZHS16CGB231280
BN8BSCII LA8ISO6937 ZHS16GBK
CDN8PC863 LA8PASSPORT ZHT16BIG5
CEL8ISO8859P14 LT8MSWIN921 ZHT16CCDC
CL8ISO8859P5 LT8PC772 ZHT16DBT
CL8KOI8R LT8PC774 ZHT16HKSCS
CL8KOI8U LV8PC1117 ZHT16MSWIN950
CL8ISOIR111 LV8PC8LR ZHT32EUC
CL8MACCYRILLICS LV8RST104090 ZHT32SOPS
CL8MSWIN1251 N8PC865 ZHT32TRIS
EE8ISO8859P2 NE8ISO8859P10 ZHS32GB18030

Language and Character Set Detection Support

Table A-10 displays the languages and character sets that are supported by the language and character set detection in the Character Set Scanner utilities (CSSCAN and LCSSCAN) and the Globalization Development Kit (GDK).

Each language has several character sets that can be detected.

When the binary values for a language match two or more encodings that have a subset/superset relationship, the subset character set is returned. For example, if the language is German and all characters are 7-bit, then US7ASCII is returned instead of WE8MSWIN1252, WE8ISO8859P15, or WE8ISO8859P1.

When the character set is determined to be UTF-8, the Oracle character set UTF8 is returned by default unless 4-byte characters (supplementary characters) are detected within the text. If 4-byte characters are detected, then the character set is reported as AL32UTF8.

Table A-10 Languages and Character Sets Supported by CSSCAN, LCSSCAN, and GDK

Language Character Sets
Arabic AL16UTF16, AL32UTF8, AR8ISO8859P6, AR8MSWIN1256, UTF8
Bulgarian AL16UTF16, AL32UTF8, CL8ISO8859P5, CL8MSWIN1251, UTF8
Catalan AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Croatian AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8
Czech AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8
Danish AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Dutch AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
English AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Estonian AL16UTF16, AL32UTF8, NEE8IOS8859P4, UTF8
Finnish AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
French AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
German AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Greek AL16UTF16, AL32UTF8, EL8ISO8859P7, EL8MSWIN1253, UTF8
Hebrew AL16UTF16, AL32UTF8, IW8ISO8859P8, IW8MSWIN1255, UTF8
Hungarian AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8
Italian AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Japanese AL16UTF16, AL32UTF8, ISO2022-JP, JA16EUC, JA16SJIS, UTF8
Korean AL16UTF16, AL32UTF8, ISO2022-KR, KO16KSC5601, KO16MSWIN949, UTF8
Malay AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Norwegian AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Polish AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8
Portuguese AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Romanian AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8
Russian AL16UTF16, AL32UTF8, CL8ISO8859P5, CL8KOI8R, CL8MSWIN1251, UTF8
Simplified Chinese AL16UTF16, AL32UTF8, HZ-GB-2312, UTF8, ZHS16GBK, ZHS16CGB231280
Slovak AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8
Spanish AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Swedish AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252
Thai AL16UTF16, AL32UTF8, TH8TISASCII, UTF8
Traditional Chinese AL16UTF16, AL32UTF8, UTF8, ZHT16MSWIN950
Turkish AL16UTF16, AL32UTF8, TR8MSWIN1254, UTF8, WE8ISO8859P9

Linguistic Sorts

Oracle offers two kinds of linguistic sorts, monolingual and multilingual. In addition, monolingual sorts can be extended to handle special cases. These special cases (represented with a prefix X) typically mean that the characters are sorted differently from their ASCII values. For example, ch and ll are treated as a single character in XSPANISH.

All of the linguistic sorts can be also be performed as case-insensitive or accent-insensitive by appending _CI or _AI to the linguistic sort name.

Table A-11 lists the monolingual linguistic sorts supported by the Oracle server.


See Also:

Table A-1, "Oracle Supported Languages" for a list of the default sort for each language

Table A-11 Monolingual Linguistic Sorts

Basic Name Extended Name Special Cases
ARABIC - -
ARABIC_MATCH - -
ARABIC_ABJ_SORT - -
ARABIC_ABJ_MATCH - -
ASCII7 - -
AZERBAIJANI XAZERBAIJANI i, I, lowercase i without dot, uppercase I with dot
BENGALI - -
BIG5 - -
BINARY - -
BULGARIAN - -
CATALAN XCATALAN æ, AE, ß
CROATIAN XCROATIAN D, L, N, d, l, n, ß
CZECH XCZECH ch, CH, Ch, ß
CZECH_PUNCTUATION XCZECH_PUNCTUATION ch, CH, Ch, ß
DANISH XDANISH A, ß, Å, å
DUTCH XDUTCH ij, IJ
EBCDIC - -
EEC_EURO - -
EEC_EUROPA3 - -
ESTONIAN - -
FINNISH - -
FRENCH XFRENCH -
GERMAN XGERMAN ß
GERMAN_DIN XGERMAN_DIN ß, ä, ö, ü, Ä, Ö, Ü
GBK - -
GREEK - -
HEBREW - -
HKSCS - -
HUNGARIAN XHUNGARIAN cs, gy, ny, sz, ty, zs, ß, CS, Cs, GY, Gy, NY, Ny, SZ, Sz, TY, Ty, ZS, Zs
ICELANDIC - -
INDONESIAN - -
ITALIAN - -
LATIN - -
LATVIAN - -
LITHUANIAN - -
MALAY - -
NORWEGIAN - -
POLISH - -
PUNCTUATION XPUNCTUATION -
ROMANIAN - -
RUSSIAN - -
SLOVAK XSLOVAK dz, DZ, Dz, ß (caron)
SLOVENIAN XSLOVENIAN ß
SPANISH XSPANISH ch, ll, CH, Ch, LL, Ll
SWEDISH - -
SWISS XSWISS ß
TURKISH XTURKISH æ, AE, ß
UKRAINIAN - -
UNICODE_BINARY - -
VIETNAMESE - -
WEST_EUROPEAN XWEST_EUROPEAN ß

Table A-12 lists the multilingual linguistic sorts available in Oracle. All of them include GENERIC_M (an ISO standard for sorting Latin-based characters) as a base. Multilingual linguistic sorts are used for a specific primary language together with Latin-based characters. For example, KOREAN_M sorts Korean and Latin-based characters, but it does not collate Chinese, Thai, or Japanese characters.

Table A-12 Multilingual LInguistic Sorts

Sort Name Description
CANADIAN_M Canadian French sort supports reverse secondary, special expanding characters
DANISH_M Danish sort supports sorting lowercase characters before uppercase characters
FRENCH_M French sort supports reverse sort for secondary
GENERIC_M Generic sorting order which is based on ISO14651 and Unicode canonical equivalence rules but excluding compatible equivalence rules
JAPANESE_M Japanese sort supports SJIS character set order and EUC characters which are not included in SJIS
KOREAN_M Korean sort: Hangul characters are based on Unicode binary order. Hanja characters based on pronunciation order. All Hangul characters are before Hanja characters
SPANISH_M Traditional Spanish sort supports special contracting characters
THAI_M Thai sort supports swap characters for some vowels and consonants
SCHINESE_RADICAL_M Simplified Chinese sort based on radical as primary order and number of strokes order as secondary order
SCHINESE_STROKE_M Simplified Chinese sort uses number of strokes as primary order and radical as secondary order
SCHINESE_PINYIN_M Simplified Chinese PinYin sorting order
TCHINESE_RADICAL_M Traditional Chinese sort based on radical as primary order and number of strokes order as secondary order
TCHINESE_STROKE_M Traditional Chinese sort uses number of strokes as primary order and radical as secondary order. It supports supplementary characters.

Calendar Systems

By default, most territory definitions use the Gregorian calendar system. Table A-11 lists the other calendar systems supported by the Oracle server.

Table A-13 Supported Calendar Systems

Name Default Date Format Character Set Used For Default Date Format
Japanese Imperial EEYYMMDD JA16EUC
ROC Official EEyymmdd ZHT32EUC
Thai Buddha dd month EE yyyy TH8TISASCII
Persian DD Month YYYY AR8ASMO8X
Arabic Hijrah DD Month YYYY AR8ISO8859P6
English Hijrah DD Month YYYY AR8ISO8859P6

Figure A-1 shows how March 27, 1998 appears in Japanese Imperial.

Figure A-1 Japanese Imperial Example

Description of nlspg005.gif follows
Description of the illustration nlspg005.gif

Time Zone Names

Table A-14 shows the time zone names in the default time zone file that is supplied with the Oracle Database. The default time zone file is $ORACLE_HOME/oracore/zoneinfo/timezlrg.dat. Oracle also supplies a smaller time zone file, $ORACLE_HOME/oracore/zoneinfo/timezone/dat. See Chapter 4, "Datetime Datatypes and Time Zone Support" for more information regarding time zone files.

Table A-14 Time Zone Names

Time Zone Name Is It in the Smaller Time Zone File? Time Zone Name Is It in the Smaller Time Zone File?
Africa/Algiers No Australia/Perth Yes
Africa/Cairo Yes Australia/Queensland Yes
Africa/Casablanca No Australia/South Yes
Africa/Ceuta No Australia/Sydney Yes
Africa/Djibouti No Australia/Tasmania Yes
Africa/Freetown No Australia/Victoria Yes
Africa/Johannesburg No Australia/West Yes
Africa/Khartoum No Australia/Yancowinna Yes
Africa/Mogadishu No Brazil/Acre Yes
Africa/Nairobi No Brazil/DeNoronha Yes
Africa/Nouakchott No Brazil/East Yes
Africa/Tripoli Yes Brazil/West Yes
Africa/Tunis No CET Yes
Africa/Windhoek No CST Yes
America/Adak Yes CST6CDT Yes
America/Anchorage Yes Canada/Atlantic Yes
America/Anguilla No Canada/Central Yes
America/Araguaina No Canada/East-Saskatchewan Yes
America/Aruba No Canada/Eastern Yes
America/Asuncion No Canada/Mountain Yes
America/Atka Yes Canada/Newfoundland Yes
America/Belem No Canada/Pacific Yes
America/Boa_Vista No Canada/Saskatchewan Yes
America/Bogota No Canada/Yukon Yes
America/Boise No Chile/Continental Yes
America/Buenos_Aires No Chile/EasterIsland Yes
America/Cambridge_Bay No Cuba Yes
America/Cancun No EET Yes
America/Caracas No EST Yes
America/Cayenne No EST5EDT Yes
America/Cayman No Egypt Yes
America/Chicago Yes Eire Yes
America/Chihuahua No Etc/GMT Yes
America/Costa_Rica No Etc/GMT+0 Yes
America/Cuiaba No Etc/GMT+1 Yes
America/Curacao No Etc/GMT+10 Yes
America/Dawson No Etc/GMT+11 Yes
America/Dawson_Creek No Etc/GMT+12 Yes
America/Denver Yes Etc/GMT+2 Yes
America/Detroit Yes Etc/GMT+3 Yes
America/Edmonton Yes Etc/GMT+4 Yes
America/El_Salvador No Etc/GMT+5 Yes
America/Ensenada Yes Etc/GMT+6 Yes
America/Fort_Wayne Yes Etc/GMT+7 Yes
America/Fortaleza No Etc/GMT+8 Yes
America/Godthab No Etc/GMT+9 Yes
America/Goose_Bay No Etc/GMT-0 Yes
America/Grand_Turk No Etc/GMT-1 Yes
America/Guadeloupe No Etc/GMT-10 Yes
America/Guatemala No Etc/GMT-11 Yes
America/Guayaquil No - -
America/Halifax Yes Etc/GMT-12 Yes
America/Havana Yes Etc/GMT-13 Yes
America/Indiana/Indianapolis Yes Etc/GMT-2 Yes
America/Indiana/Knox No Etc/GMT-3 Yes
America/Indiana/Marengo No Etc/GMT-4 Yes
America/Indiana/Vevay No Etc/GMT-5 Yes
America/Indianapolis Yes Etc/GMT-6 Yes
America/Inuvik No Etc/GMT-7 Yes
America/Iqaluit No Etc/GMT-8 Yes
America/Jamaica Yes Etc/GMT-9 Yes
America/Juneau No Etc/GMT0 Yes
America/Knox_IN No Etc/Greenwich Yes
America/La_Paz No Europe/Amsterdam No
America/Lima No Europe/Athens No
America/Los_Angeles Yes Europe/Belfast No
America/Louisville No Europe/Belgrade No
America/Maceio No Europe/Berlin No
America/Managua No Europe/Bratislava No
America/Manaus Yes Europe/Brussels No
America/Martinique No Europe/Bucharest No
America/Mazatlan Yes Europe/Budapest No
America/Mexico_City Yes Europe/Copenhagen No
America/Miquelon No Europe/Dublin Yes
America/Montevideo No Europe/Gibraltar No
America/Montreal Yes Europe/Helsinki No
America/Montserrat No Europe/Istanbul Yes
America/New_York Yes Europe/Kaliningrad No
America/Nome No Europe/Kiev No
America/Noronha Yes Europe/Lisbon Yes
America/Panama No Europe/Ljubljana No
America/Phoenix Yes Europe/London Yes
America/Porto_Acre No Europe/Luxembourg No
America/Porto_Velho No Europe/Madrid No
America/Puerto_Rico No Europe/Minsk No
America/Rankin_Inlet No Europe/Monaco No
America/Regina Yes Europe/Moscow Yes
America/Rio_Branco Yes - -
America/Santiago Yes Europe/Oslo No
America/Sao_Paulo Yes Europe/Paris No
America/Scoresbysund No Europe/Prague No
America/Shiprock Yes Europe/Riga No
America/St_Johns Yes Europe/Rome No
America/St_Thomas No Europe/Samara No
America/Swift_Current No Europe/San_Marino No
America/Tegucigalpa No Europe/Sarajevo No
America/Thule No Europe/Simferopol No
America/Thunder_Bay No Europe/Skopje No
America/Tijuana Yes Europe/Sofia No
America/Tortola No Europe/Stockholm No
America/Vancouver Yes Europe/Tallinn No
America/Virgin No Europe/Tirane No
America/Whitehorse Yes Europe/Vatican No
America/Winnipeg Yes Europe/Vienna No
America/Yellowknife No Europe/Vilnius No
Arctic/Longyearbyen No Europe/Warsaw Yes
Asia/Aden No Europe/Zagreb No
Asia/Almaty No Europe/Zurich No
Asia/Amman No GB Yes
Asia/Anadyr No GB-Eire Yes
Asia/Aqtau No GMT Yes
Asia/Aqtobe No GMT+0 Yes
Asia/Baghdad No GMT-0 Yes
Asia/Bahrain No GMT0 Yes
Asia/Baku No Greenwich Yes
Asia/Bangkok No HST Yes
Asia/Beirut No Hongkong Yes
Asia/Bishkek No Iceland Yes
Asia/Calcutta Yes Indian/Chagos No
Asia/Chongqing No - -
Asia/Chungking No Indian/Christmas No
Asia/Dacca No Indian/Cocos No
Asia/Damascus No Indian/Mayotte No
Asia/Dhaka No - -
Asia/Dubai No Indian/Reunion No
Asia/Gaza No Iran Yes
Asia/Harbin No Israel Yes
Asia/Hong_Kong Yes Jamaica Yes
Asia/Irkutsk No Japan Yes
Asia/Istanbul Yes Kwajalein Yes
Asia/Jakarta No Libya Yes
Asia/Jayapura No MET Yes
Asia/Jerusalem Yes MST Yes
Asia/Kabul No MST7MDT Yes
Asia/Kamchatka No Mexico/BajaNorte Yes
Asia/Karachi No Mexico/BajaSur Yes
Asia/Kashgar No Mexico/General Yes
Asia/Krasnoyarsk No NZ Yes
Asia/Kuala_Lumpur No NZ-CHAT Yes
Asia/Kuching No Navajo Yes
Asia/Kuwait No PRC Yes
Asia/Macao No PST Yes
Asia/Macau No - -
Asia/Magadan No PST8PDT Yes
Asia/Manila No Pacific/Auckland Yes
Asia/Muscat No Pacific/Chatham Yes
Asia/Nicosia No Pacific/Easter Yes
Asia/Novosibirsk No Pacific/Fakaofo No
Asia/Omsk No Pacific/Fiji No
Asia/Qatar No Pacific/Gambier No
Asia/Rangoon No Pacific/Guam No
Asia/Riyadh Yes Pacific/Honolulu Yes
Asia/Saigon No Pacific/Johnston No
Asia/Seoul Yes Pacific/Kiritimati No
Asia/Shanghai Yes Pacific/Kwajalein Yes
Asia/Singapore Yes Pacific/Marquesas No
Asia/Taipei Yes Pacific/Midway No
Asia/Tashkent No Pacific/Niue No
Asia/Tbilisi No Pacific/Norfolk No
Asia/Tehran Yes Pacific/Noumea No
Asia/Tel_Aviv Yes Pacific/Pago_Pago Yes
Asia/Tokyo Yes Pacific/Pitcairn No
Asia/Ujung_Pandang No Pacific/Rarotonga No
Asia/Urumqi No Pacific/Saipan No
Asia/Vladivostok No Pacific/Samoa Yes
Asia/Yakutsk No Pacific/Tahiti No
Asia/Yekaterinburg No Pacific/Tongatapu No
Asia/Yerevan No Pacific/Wake No
Atlantic/Azores No Pacific/Wallis No
Atlantic/Bermuda No Poland Yes
Atlantic/Canary No Portugal Yes
Atlantic/Faeroe No ROC Yes
Atlantic/Madeira No ROK Yes
Atlantic/Reykjavik Yes Singapore Yes
Atlantic/St_Helena No Turkey Yes
Atlantic/Stanley No US/Alaska Yes
Australia/ACT Yes US/Aleutian Yes
Australia/Adelaide Yes US/Arizona Yes
Australia/Brisbane Yes US/Central Yes
Australia/Broken_Hill Yes US/East-Indiana Yes
Australia/Canberra Yes US/Eastern Yes
Australia/Darwin Yes US/Hawaii Yes
Australia/Hobart Yes US/Indiana-Starke No
Australia/LHI Yes US/Michigan Yes
Australia/Lindeman Yes US/Mountain Yes
Australia/Lord_Howe Yes US/Pacific Yes
Australia/Melbourne Yes US/Pacific-New Yes
Australia/NSW Yes US/Samoa Yes
Australia/North Yes UTC No
- - W-SU Yes
- - WET Yes

Obsolete Locale Data

This section contains information about obsolete linguistic sorts, character sets, languages, and territories.

Updates to the Oracle Language and Territory Definition Files

Changes have been made to the content in some of the language and territory definition files in Oracle Database 10g. These updates are necessary to correct the legacy definitions which no longer meet the local conventions in some of the Oracle supported languages and territories. These changes include modifications to the currency symbols, month names, and group separators. One example is the local currency symbol for Brazil. This has been updated from Cr$ to R$ in Oracle Database 10g.

Please refer to the "Oracle Language and Territory definition changes" table documented in the $ORACLE_HOME/nls/data/old/data_changes.html file for a detailed list of the changes.

Oracle Database 10g customers should review their existing application code to make sure that the correct cultural conventions that are defined in Oracle Database 10g are being used. For customers who may not be able to make the necessary code changes to support their applications, Oracle offers Oracle9i locale definition files with Oracle Database 10g.

To revert back to the Oracle9i language and territory behavior, perform the following:

  1. Shutdown the database.

  2. Run the script cr9idata.pl from the $ORACLE_HOME/nls/data/old directory.

  3. Set the ORA_NLS10 environment variable to the newly created $ORACLE_HOME/nls/data/9idata directory.

  4. Restart the database.

Steps 2 and 3 will need to be repeated for all 10g database clients that need to revert back to the Oracle9i definition files.

Oracle Corporation strongly recommends that customers use the Oracle Database 10g locale definition files; Oracle9i locale definition files will be desupported in a future release.

Obsolete Linguistic Sorts

Table A-15 contains linguistic sorts that have been desupported in Oracle Database 10g.

Table A-15 Obsolete Linguistic Sorts in Oracle Database 10g

Obsolete Sort Name Replacement Sort
THAI_TELEPHONE THAI_M
THAI_DICTIONARY THAI_M
CANADIAN_FRENCH CANADIAN_M
JAPANESE JAPANESE_M

CIS Is No Longer the Default Territory When the Language is RUSSIAN

RUSSIA has been added as a territory in Oracle Database 10g. It is the default territory when the language is RUSSIAN. The CIS territory, which was the default territory in previous releases when the language was RUSSIAN, is supported for backward compatibility.

YUGOSLAVIA Is a Deprecated Territory

YUGOSLAVIA is a deprecated territory in Oracle Database 10g. Use SERBIA AND MONTENEGRO instead.

New Names for Obsolete Character Sets

Before Oracle release 7.2, when a character set was renamed, the old name was usually supported along with the new name for several releases after the change. Beginning with release 7.2, the old names are no longer supported.

Table A-16 lists the affected character sets. If you reference any of these character sets in your code, then replace them with their new name.

Table A-16 New Names for Obsolete Character Sets

Old Name New Name
AL24UTFSS UTF8, AL32UTF8
AR8MSAWIN AR8MSWIN1256
CL8EBCDIC875S CL8EBCDIC875R
EL8EBCDIC875S EL8EBCDIC875R
JVMS JA16VMS
JEUC JA16EUC
SJIS JA16SJIS
JDBCS JA16DBCS
KSC5601 KO16KSC5601
KDBCS KO16DBCS
CGB2312-80 ZHS16CGB231280
CNS 11643-86 ZHT32EUC
JA16EUCFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHS32EUCFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHS16GBKFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
JA16DBCSFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
KO16DBCSFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHS16DBCSFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHS16CGB231280FIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHT16DBCSFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
KO16KSC5601FIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
JA16SJISFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHT16BIG5FIXED None. Replaced by new national character set. UTF8 and AL16UTF16.
ZHT32TRISFIXED None. Replaced by new national character set. UTF8 and AL16UTF16.

Character set CL8MSWINDOW31 has been desupported. The newer character set CL8MSWIN1251 is actually a duplicate of CL8MSWINDOW31 and includes some characters omitted from the earlier version. Change any usage of CL8MSWINDOW31 to CL8MSWIN1251 instead.

AL24UTFFSS Character Set Desupported

The Unicode Character Set AL24UTFFSS was desupported in Oracle9i. AL24UTFFSS was introduced in version 7 as the Unicode character set supporting UTF-8 encoding scheme based on the Unicode standard 1.1, which is now obsolete. In Oracle Database 10g, Oracle offers the Unicode database character set AL32UTF8, which is based on Unicode 3.2, and UTF8, which is based on Unicode 3.0.

The migration path for an existing AL24UTFFSS database is to upgrade to UTF8 prior to upgrading to Oracle9i. As with all migrations to a new database character set, Oracle Corporation recommends that you use the Character Set Scanner for data analysis before attempting to migrate your existing database character set to UTF8.

Bengali Language Definition Deprecated

Oracle's Bengali language definition is not compatible with Unicode standards. Oracle recommends that customers use the Bangla language definition instead. Bangla was introduced in Oracle9i Database Release 1 (9.0.1).

The Bengali language definition is supported in Oracle Database 10g Release 1 (10.1), but it may be desupported in a future release.

Czechoslovakia Territory Definition Deprecated

Oracle recommends that customers use either Czech Republic or Slovakia territory definitions in Oracle Database 10g Release 1 (10.1). The Czechoslovakia territory definition is supported in Oracle Database 10g Release 1 (10.1), but it may be desupported in a future release.