On Tue, Dec 07, 2004 at 01:45:57PM +1300, Jeff McClintock wrote: > Handling UTF-8 multibyte characters is a big inefficient pain. Some > characters are 8 bits, some are extended to multiple bytes. Some > strings have mixtures of single-byte and multi-byte chracters Any kind > of string manipulation is complicated by having to scan the string from > the beginning so as to avoid chopping a multi-byte character in two. > Novice programmers ignore the multi-byte characters and treat the string > as ASCII, leading to internationalization bugs. > I believe most programmers will find it easier to deal with > fixed-width strings using the ANSI wchar_t datatype. I'm no localization expert. So I'll defer on arguing this any further. I don't want to make string handling a pain in the ass, but obviously it will be for someone if it is not for someone else :) Anyone else? Maybe the requirement should be more vague, and punt the decision to the spec? ---------------------------------------------------------------------- Generalized Music Plugin Interface (GMPI) public discussion list Participation in this list is contingent upon your abiding by the following rules: Please stay on topic. You are responsible for your own words. Please respect your fellow subscribers. Please do not redistribute anyone else's words without their permission. Archive: //www.freelists.org/archives/gmpi Email gmpi-request@xxxxxxxxxxxxx w/ subject "unsubscribe" to unsubscribe