1·No longer can the program assume that one byte is one character, so all data has to be decoded from UTF-8 and encoded back to UTF-8.
程序不再假设一个字节就是一个字符,因此所有的数据都需要从 UTF-8 进行解码,然后再重新编码成 UTF-8。
2·Each byte of a UTF-8 stream or sequence is unambiguous.
UTF - 8流或序列中的每个字节都是明确的。
3·When the database is UTF-8, the data can be manipulated in the database code page.
当数据库是UTF - 8数据库时,可以在数据库代码页中操纵数据。
4·For multilingual applications, usually we pick UTF-8 as the character set.
对于多语言应用程序,我们通常选取utf - 8作为字符集。
5·Note the specification of UTF-8 as the codeset for the database.
注意,该语句指定 UTF-8 作为这个数据库的编码集。
6·This is acceptable, but UTF-8 support MUST be possible.
这是允许的,但是必须能够支持 UTF-8。
7·Every new protocol, application, and document should use UTF-8.
每种新的协议、应用程序和文档都应该使用UTF - 8。
8·Often these are UTF-8 encoded characters, which has its own complications; see the UTF-8 discussion later in this article.
通常这些都是UTF - 8编码的字符,有其自身的复杂性;可以参考本文后面关于UTF - 8的讨论。
9·These additional characters, also called supplementary characters, can be represented in UTF-8 using 4 bytes.
这些额外字符也称为补充字符,可以在utf - 8中用4字节表示。
10·In theory, you can use other encodings than UTF-8, but UTF-8 creates less trouble than other systems.
理论上,可以使用除utf - 8外的其他编码,但比起其他系统,使用UTF - 8麻烦要少得多。