This page only gives you an example of how you can encode characters in UTF-16 whose Unicode values are higher than 0xFFFF. Read RFC 2781 for first-hand information.
Say you want to encode
whose Unicode value is 2A6D6. 0x10000 subtracted from 0x2A6D6 gives 0x1A6D6.
Convert 0x1A6D6 to a 20-digit binary number to get
Fill in the empty slots of the following template
with the twenty digits:
and you will get this surrogate pair: 0xD869 0xDED6.
Paul Hoffman & François Yergeau. 2000. UTF-16, an encoding of ISO 10646. RFC 2781.
© 2000-2002 Gyula Zsigri | [Back] [Home] [CJK] | Last updated: December 22, 2002 |