Is Apple's UTF-8 decoder safe?
Is Apple's UTF-8 decoder safe?
- Subject: Is Apple's UTF-8 decoder safe?
- From: Malte Tancred <email@hidden>
- Date: Thu, 2 May 2002 11:17:29 +0200
Hi!
I found a FAQ about UTF-8 and Unicode written by Markus Kuhn.
http://www.cl.cam.ac.uk/~mgk25/unicode.html
Markus has compiled "a systematic collection of malformed and
overlong UTF-8 sequences" which can be found here:
http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt
When viewing this stress test file in eg TextEdit or Terminal
it appears that the UTF-8 decoding taking place isn't as "safe"
as it should be, according to Markus.
For example, all overlong representations of the ascii slash
(/) character are all rendered as a slash and are not replaced.
(test 4.1 in the file).
There are several other tests that fail as well.
Any comments?
Cheerio,
Malte
--
Malte Tancred
Computer programmer, Oops AB, Sweden
mailto:email@hidden
http://www.oops.se/
_______________________________________________
cocoa-dev mailing list | email@hidden
Help/Unsubscribe/Archives:
http://www.lists.apple.com/mailman/listinfo/cocoa-dev
Do not post admin requests to the list. They will be ignored.