Character Sets and Encoding Forms listed by Language

[CJK]  [Home]

UTF-16, the 16-bit encoded "transformation format" of  Unicode does not fit this table, therefore I provide a link to it here.

Languages Character Sets 8-bit Encoding Forms 7-bit Encoding Forms
Multilingual Unicode UTF-8 MIME, UTF-7
EACC 4-byte EACC 3-byte EACC
Traditional Chinese Big5 Big5 MIME
CNS EUC-TW MIME
Simplified Chinese GB2312-80 EUC-CN MIME, HZ
GBK GBK MIME
Japanese JIS X 0208 EUC-JP MIME, ISO-2022-JP
Shift-JIS MIME
Korean KS C 5601-1987 (Wansung) EUC-KR MIME, ISO-2022-KR
Unified Hangul Code Unified Hangul Code MIME
KSSM Johab ?

© Gyula Zsigri, 2000-2002 [CJK]  [Home] Last updated:  July 11, 2002