CJK Character Sets and Encoding Forms

[CJK]  [Home]

Contents


Basic Notions

Contents Up

The most popular character sets and encoding forms

Languages Character Sets Encoding Forms
Multilingual Unicode UTF-8
UTF-16
Traditional Chinese Big5 Big5
Simplified Chinese GB EUC-CN
Japanese JIS EUC-JP
Shift-JIS
Korean KSC EUC-KR

Click here for a bigger table.

Contents Up

Test your Browser

If you try to read a document in the wrong decoding mode, you will see gibberish instead of intelligible text.  You can change the decoding mode of your browser by clicking on View : Character Coding, or View  Encoding.

These two bytes
 
α
should be displayed as alpha alpha in Multilingual UTF-8 decoding mode
qia4 qia4 in Traditional Chinese Big5
wei3 wei3 in Simplified Chinese GB2312
ryuu ryuu in Japanese EUC-JP
gwan gwan in Korean EUC-KR

Click here if your browser does not properly decode CJK.

Contents Up

Links

Contents Up

© Gyula Zsigri, 2000-2002 [CJK]  [Home] Last updated:  June 13, 2002