Pascal Goguey <pascal@xxxxxxxxxx> wrote: > Unicode spans actually (at least the last version I have seen a few > months ago, > 3,2??) between 0 and approximately 110000 (roughly 20+ bits). > The characters are under 2^20, and the extra 10000 are characters to > be > used > privately. They are unassigned, and any application can set up its > own > control > characters. That's basically the idea. There are other private areas > under 2^20 > and even under 2^16, this one I know: from E000 to F8FF. > For more info, there is a Unicode home page. > As for the 4 bytes, maybe you were talking about UTF32, not Unicode. Sorry, I didn't want to give the impression that all possible 4 byte values are actually assigned characters. I just wanted to point out that UTF-8 is not restricted to 4 bytes anymore, and that UCS-16 cannot represent all valid characters. Thanks for the info anyway! :-) Bye, Axel.