#====================================================================# #-- You can always find the latest version of this file at: #-- http://mimersbrunn.sourceforge.net #-- FILENAME -----: tk_charset #-- DESCRIPTION --: Mapping of Tcl-encoding names to IANA-list #-- The encoding names are from Tcl 8.3 #-- AUTHOR -------: Veronica Loell #-- EMAIL -: info@nakawe.se #-- FILE CREATED -: 2002-02-24 14:44 #-- LAST CHANGED -: 2002-02-24 14:44 #-- COPYRIGHT ----: Veronica Loell # This document is released under: # GNU Free Documentation License Version 1.1, March 2000 #====================================================================# References: [http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html] (Rosette 3.0) [http://www.eki.ee/letter/] (viewed at: 2002-02-24 14:19) [http://www.iana.org/assignments/character-sets] (last updated 2001 August 23) =================================================================== cp860 Name: IBM860 [RFC1345,KXS2] MIBenum: 2048 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp860 Alias: 860 Alias: csIBM860 cp861 Name: IBM861 [RFC1345,KXS2] MIBenum: 2049 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp861 Alias: 861 Alias: cp-is Alias: csIBM861 cp862 Name: IBM862 [RFC1345,KXS2] MIBenum: 2013 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp862 Alias: 862 Alias: csPC862LatinHebrew cp863 Name: IBM863 [RFC1345,KXS2] MIBenum: 2050 Source: IBM Keyboard layouts and code pages, PN 07G4586 June 1991 Alias: cp863 Alias: 863 Alias: csIBM863 cp864 Name: IBM864 [RFC1345,KXS2] MIBenum: 2051 Source: IBM Keyboard layouts and code pages, PN 07G4586 June 1991 Alias: cp864 Alias: csIBM864 cp865 Name: IBM865 [RFC1345,KXS2] MIBenum: 2052 Source: IBM DOS 3.3 Ref (Abridged), 94X9575 (Feb 1987) Alias: cp865 Alias: 865 Alias: csIBM865 cp866 Name: IBM866 [Pond] MIBenum: 2086 Source: IBM NLDG Volume 2 (SE09-8002-03) August 1994 Alias: cp866 Alias: 866 Alias: csIBM866 cp869 Name: IBM869 [RFC1345,KXS2] MIBenum: 2054 Source: IBM Keyboard layouts and code pages, PN 07G4586 June 1991 Alias: cp869 Alias: 869 Alias: cp-gr Alias: csIBM869 cp737 DOS Greek source: http://www.eki.ee/letter/ cp949 Korean Microsoft & IBM source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html cp950 Chinese, Traditional Microsoft & IBM source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html gb12345 Chinese, Traditional International or National Standard source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html dingbats Adobe-Zapf- Dingbats-Encoding Symbol (used in PS printers) Adobe source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html ksc5601 Name: KS_C_5601-1987 [RFC1345,KXS2] MIBenum: 36 Source: ECMA registry Alias: iso-ir-149 Alias: KS_C_5601-1989 Alias: KSC_5601 Alias: korean Alias: csKSC56011987 macCentEuro ? cp874 Thai source: http://www.eki.ee/letter/ macUkraine Ukranian Macintosh source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html jis0201 Name: JIS_C6220-1969-jp [RFC1345,KXS2] MIBenum: 41 Source: ECMA registry Alias: JIS_C6220-1969 Alias: iso-ir-13 Alias: katakana Alias: x0201-7 Alias: csISO13JISC6220jp gb2312 Name: GB_2312-80 [RFC1345,KXS2] MIBenum: 57 Source: ECMA registry Alias: iso-ir-58 Alias: chinese Alias: csISO58GB231280 euc-cn Name: GB2312 (preferred MIME name) MIBenum: 2025 Source: Chinese for People's Republic of China (PRC) mixed one byte, two byte set: 20-7E = one byte ASCII A1-FE = two byte PRC Kanji See GB 2312-80 PCL Symbol Set Id: 18C Alias: csGB2312 euc-jp Name: Extended_UNIX_Code_Packed_Format_for_Japanese MIBenum: 18 Source: Standardized by OSF, UNIX International, and UNIX Systems Laboratories Pacific. Uses ISO 2022 rules to select code set 0: US-ASCII (a single 7-bit byte set) code set 1: JIS X0208-1990 (a double 8-bit byte set) restricted to A0-FF in both bytes code set 2: Half Width Katakana (a single 7-bit byte set) requiring SS2 as the character prefix code set 3: JIS X0212-1990 (a double 7-bit byte set) restricted to A0-FF in both bytes requiring SS3 as the character prefix Alias: csEUCPkdFmtJapanese Alias: EUC-JP (preferred MIME name) macThai Thai Macintosh source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html jis0208 Name: JIS_C6226-1983 [RFC1345,KXS2] MIBenum: 63 Source: ECMA registry Alias: iso-ir-87 Alias: x0208 Alias: JIS_X0208-1983 Alias: csISO87JISX0208 iso2022-jp Name: ISO-2022-JP (preferred MIME name) [RFC1468,Murai] MIBenum: 39 Source: RFC-1468 (see also RFC-2237) Alias: csISO2022JP macIceland Icelandic Macintosh source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html iso2022 ? jis0212 Name: JIS_X0212-1990 [RFC1345,KXS2] MIBenum: 98 Source: ECMA registry Alias: x0212 Alias: iso-ir-159 Alias: csISO159JISX02121990 big5 Name: Big5 (preferred MIME name) MIBenum: 2026 Source: Chinese for Taiwan Multi-byte set. PCL Symbol Set Id: 18T Alias: csBig5 euc-kr Name: EUC-KR (preferred MIME name) [RFC1557,Choi] MIBenum: 38 Source: RFC-1557 (see also KS_C_5861-1992) Alias: csEUCKR macRomania MacRomanian Romanian Macintosh source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html macTurkish Turkish Macintosh source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html gb1988 Name: GB_1988-80 [RFC1345,KXS2] MIBenum: 56 Source: ECMA registry Alias: iso-ir-57 Alias: cn Alias: ISO646-CN Alias: csISO57GB1988 iso2022-kr Name: ISO-2022-KR (preferred MIME name) [RFC1557,Choi] MIBenum: 37 Source: RFC-1557 (see also KS_C_5601-1987) Alias: csISO2022KR macGreek Greek Macintosh source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html ascii Name: ANSI_X3.4-1968 [RFC1345,KXS2] MIBenum: 3 Source: ECMA registry Alias: iso-ir-6 Alias: ANSI_X3.4-1986 Alias: ISO_646.irv:1991 Alias: ASCII Alias: ISO646-US Alias: US-ASCII (preferred MIME name) Alias: us Alias: IBM367 Alias: cp367 Alias: csASCII cp437 Name: IBM437 [RFC1345,KXS2] MIBenum: 2011 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp437 Alias: 437 Alias: csPC8CodePage437 macRoman Latin Macintosh source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html iso8859-1 Name: ISO_8859-1:1987 [RFC1345,KXS2] MIBenum: 4 Source: ECMA registry Alias: iso-ir-100 Alias: ISO_8859-1 Alias: ISO-8859-1 (preferred MIME name) Alias: latin1 Alias: l1 Alias: IBM819 Alias: CP819 Alias: csISOLatin1 iso8859-2 Name: ISO_8859-2:1987 [RFC1345,KXS2] MIBenum: 5 Source: ECMA registry Alias: iso-ir-101 Alias: ISO_8859-2 Alias: ISO-8859-2 (preferred MIME name) Alias: latin2 Alias: l2 Alias: csISOLatin2 iso8859-3 Name: ISO_8859-3:1988 [RFC1345,KXS2] MIBenum: 6 Source: ECMA registry Alias: iso-ir-109 Alias: ISO_8859-3 Alias: ISO-8859-3 (preferred MIME name) Alias: latin3 Alias: l3 Alias: csISOLatin3 macCroatian Croatian Macintosh source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html koi8-r Name: KOI8-R (preferred MIME name) [RFC1489] MIBenum: 2084 Source: RFC 1489, based on GOST-19768-74, ISO-6937/8, INIS-Cyrillic, ISO-5427. Alias: csKOI8R iso8859-4 Name: ISO_8859-4:1988 [RFC1345,KXS2] MIBenum: 7 Source: ECMA registry Alias: iso-ir-110 Alias: ISO_8859-4 Alias: ISO-8859-4 (preferred MIME name) Alias: latin4 Alias: l4 Alias: csISOLatin4 iso8859-5 Name: ISO_8859-5:1988 [RFC1345,KXS2] MIBenum: 8 Source: ECMA registry Alias: iso-ir-144 Alias: ISO_8859-5 Alias: ISO-8859-5 (preferred MIME name) Alias: cyrillic Alias: csISOLatinCyrillic cp1250 Name: windows-1250 MIBenum: 2250 Source: Microsoft (see ../character-set-info/windows-1250) [Lazhintseva] Alias: None macCyrillic Cyrillic Macintosh source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html iso8859-6 Name: ISO_8859-6:1987 [RFC1345,KXS2] MIBenum: 9 Source: ECMA registry Alias: iso-ir-127 Alias: ISO_8859-6 Alias: ISO-8859-6 (preferred MIME name) Alias: ECMA-114 Alias: ASMO-708 Alias: arabic Alias: csISOLatinArabic cp1251 Name: windows-1251 MIBenum: 2251 Source: Microsoft (see ../character-set-info/windows-1251) [Lazhintseva] Alias: None macDingbats Symbol Macintosh source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html iso8859-7 Name: ISO_8859-7:1987 [RFC1947,RFC1345,KXS2] MIBenum: 10 Source: ECMA registry Alias: iso-ir-126 Alias: ISO_8859-7 Alias: ISO-8859-7 (preferred MIME name) Alias: ELOT_928 Alias: ECMA-118 Alias: greek Alias: greek8 Alias: csISOLatinGreek cp1252 Name: windows-1252 MIBenum: 2252 Source: Microsoft (see ../character-set-info/windows-1252) [Wendt] Alias: None iso8859-8 Name: ISO_8859-8:1988 [RFC1345,KXS2] MIBenum: 11 Source: ECMA registry Alias: iso-ir-138 Alias: ISO_8859-8 Alias: ISO-8859-8 (preferred MIME name) Alias: hebrew Alias: csISOLatinHebrew cp1253 Name: windows-1253 MIBenum: 2253 Source: Microsoft (see ../character-set-info/windows-1253) [Lazhintseva] Alias: None iso8859-9 Name: ISO_8859-9:1989 [RFC1345,KXS2] MIBenum: 12 Source: ECMA registry Alias: iso-ir-148 Alias: ISO_8859-9 Alias: ISO-8859-9 (preferred MIME name) Alias: latin5 Alias: l5 Alias: csISOLatin5 cp1254 Name: windows-1254 MIBenum: 2254 Source: Microsoft (see ../character-set-info/windows-1254) [Lazhintseva] Alias: None cp1255 Name: windows-1255 MIBenum: 2255 Source: Microsoft (see ../character-set-info/windows-1255) [Lazhintseva] Alias: None cp850 Name: IBM850 [RFC1345,KXS2] MIBenum: 2009 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp850 Alias: 850 Alias: csPC850Multilingual cp1256 Name: windows-1256 MIBenum: 2256 Source: Microsoft (see ../character-set-info/windows-1256) [Lazhintseva] Alias: None cp932 Shift-JISMS Japanese MS_Kanji, CP932 Microsoft & IBM Shift-JIS, SJIS source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html identity ? cp1257 Name: windows-1257 MIBenum: 2257 Source: Microsoft (see ../character-set-info/windows-1257) [Lazhintseva] Alias: None cp852 Name: IBM852 [RFC1345,KXS2] MIBenum: 2010 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp852 Alias: 852 Alias: csPCp852 macJapan MacJapanese Japanese Macintosh source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html cp1258 Name: windows-1258 MIBenum: 2258 Source: Microsoft (see ../character-set-info/windows-1258) [Lazhintseva] Alias: None shiftjis Name:Shift_JIS (preferred MIME name) MIBenum: 17 Source: This charset is an extension of csHalfWidthKatakana by adding graphic characters in JIS X 0208. The CCS's are JIS X0201:1997 and JIS X0208:1997. The complete definition is shown in Appendix 1 of JIS X0208:1997. This charset can be used for the top-level media type "text". Alias: MS_Kanji Alias: csShiftJIS utf-8 Name: UTF-8 [RFC2279] MIBenum: 106 Source: RFC 2279 Alias: None cp855 Name: IBM855 [RFC1345,KXS2] MIBenum: 2046 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp855 Alias: 855 Alias: csIBM855 cp936 Script:Chinese, Simplified Alias: GBK Vendor:Microsoft & IBM source: http://rosette.basistech.com/api3-0/Uniconv/currentencodings.html symbol Name: Adobe-Symbol-Encoding [Adobe] MIBenum: 2020 Source: PostScript Language Reference Manual PCL Symbol Set id: 5M Alias: csHPPSMath cp775 Name: IBM775 [HP-PCL5] MIBenum: 2087 Source: HP PCL 5 Comparison Guide (P/N 5021-0329) pp B-13, 1996 Alias: cp775 Alias: csPC775Baltic unicode Name: ISO-10646-UCS-2 MIBenum: 1000 Source: the 2-octet Basic Multilingual Plane, aka Unicode this needs to specify network byte order: the standard does not specify (it is a 16-bit integer space) Alias: csUnicode cp857 Name: IBM857 [RFC1345,KXS2] MIBenum: 2047 Source: IBM NLS RM Vol2 SE09-8002-01, March 1990 Alias: cp857 Alias: 857 Alias: csIBM857