[openbeos] Re: StyledEdit news of various interest

> Andrew Bachmann wrote on Tue, 15 Jul 2003 23:01:54 -0700 PDT:
> > As part of my effort to move towards better support for encodings in 
> > general I tried to 
> > create an abstraction to manage encodings.  [...]
> 
> Sounds familiar.  MDR's BeMail has a table of Human readable character set 
> names and some 
> rules for encoding them into e-mail (7bit, 8bit, quoted printable etc).  I 
> also had to add a wrapper 
around the UTF-8 conversion functions to add US-ASCII and UTF-8 (UTF-8 to UTF-8 
conversion) character sets.
> 
> - Alex
> 

Yes this is part of the reason why I spent some time trying to do it "right".  
Also in this camp is 
Net+ and perhaps some instant messaging folks.  Anything that deals with text 
that may have 
come from a different OS.  Usually it's over the network but sometimes through 
the filesystem. 
(or both :-) )

IANA has an entry for ASCII although I didn't put it in the set of supported 
character sets.  As 
you noted, there isn't a convert_to_utf8(B_ASCII....) or vice versa.  The IANA 
doesn't have very 
much meta-information about character sets.  For example, it doesn't even list 
if they are multi-
byte.  It does list the MIME names for the character sets, which I thought 
would be useful for 
mail.  Plenty of encodings don't have MIME standard names though.

Also, for my own selfish reasons I want to add support for the Big-5 and EUC-CN 
encodings.  It's 
my understanding that the yellowtab tree has an expanded set of conversions 
available in 
UTF8.h  Should we try to find out what these are so our constants can share 
values?  [this seems 
more important since the encoding is stored as a number in an attribute on 
saved text files]

Andrew


Other related posts: