Re: [2.6 patch] UTF-8 fixes in comments
From: H. Peter Anvin
Date: Tue Apr 29 2008 - 15:34:25 EST
Alan Cox wrote:
Funny that you mention Windows. Windows has been using 16-bit unicode
for a long time without problems. It's a clean encoding. Like it or not.
I would describe the UCS-2 situation as a disaster area - embedded nuls
causing breakage, inability to represent the full unicode space and
awkward programming interfaces.
Not to mention the fact that UCS-2 ran out of code points almost as soon
as they said "no more codepoints." The result was UTF-16, a hideous
abortion which took all the problems with wide encodings, combined it
with all the problems of multibyte encodings, and added a few new ones
for good measure.
-hpa
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/