Re: wchar_t and printf not working
site_archiver@lists.apple.com Delivered-To: darwin-dev@lists.apple.com On Mon, 28 Mar 2005 11:36:10 +0400 Alexey Proskuryakov <ap-carbon@rambler.ru> wrote:
Each character may occupy between 1 and 6 bytes [1].
More precisely, between 1 and 4: <http://www.unicode.org/faq/utf_bom.html#30>.
At risk of being pedantic this is just talking about how to convert a UTF-16 character into a UTF-8 one. Because UTF-16 with a surrogate can only represent 21 bits of the Unicode code space only 4 bytes is necessary to encode any character in UTF-8. But UTF-8 can encode the full 31 bit code space which needs at most 6 bytes to be represented in UTF-8. But unless you're doing Klingon you'll never actually see more than 4. Mike -- IRC - where men are men, women are men, and the boys are FBI agents. _______________________________________________ Do not post admin requests to the list. They will be ignored. Darwin-dev mailing list (Darwin-dev@lists.apple.com) Help/Unsubscribe/Update your Subscription: http://lists.apple.com/mailman/options/darwin-dev/site_archiver%40lists.appl... This email sent to site_archiver@lists.apple.com
participants (1)
-
Michael B Allen