About Unicode and UTF-8
UCS stand for Universal Character Set is defined in the ISO standard 10646, it contains character to represent practically konw languages.
Basic Multilingual Plane (BMP) places the first 65534 positions(from 0×0000 to 0x FFFD).
Combing Charactor is link accent of encoidng.And it is defined in UCS.
Unicode is started by a consortium, and at 1990 ISO and it found that two different unified character set is not the world needs. So they join the efforts and work together on creating a singal code table.
发表评论
| Trackback
