About Unicode and UTF-8

2009年12月1日 | 分类: 未分类 | 标签:

UCS stand for Universal Character Set is defined in the ISO standard 10646, it contains character to represent practically konw languages.

Basic Multilingual Plane (BMP) places the first 65534 positions(from 0×0000 to 0x FFFD).

Combing Charactor is link accent of encoidng.And it is defined in UCS.

Unicode is started by a consortium, and at 1990 ISO and it found that two different unified character set is not the world needs. So they join the efforts and work together on creating a singal code table.

目前还没有任何评论.
注意: 评论者允许使用'@user:'的方式将自己的评论通知另外评论者。例如, ABC是本文的评论者之一,则使用'@ABC:'(不包括单引号)将会自动将您的评论发送给ABC。请务必注意user必须和评论者名相匹配(大小写一致)。