[Ilugc] Re: Character encoding issues
- From: masatran@xxxxxxxxxxxxxxxxxxx (Masatran, R. Deepak)
- Date: Thu Jul 3 23:41:38 2008
* Masatran, R. Deepak <masatran@xxxxxxxxxxxxxxxxxxx> 2008-07-03
I just did
tail /usr/share/dict/words
in Debian Etch, but I got some unreadable characters. The problem is not
with my terminal (xfce4-terminal) as it handles even Tamil script. The file
is probably encoded in ISO-8859-1. So how do I cat it? I don't want to
change my locale, which is:
export LANG='en_IN.UTF-8'
Let me ask a simple question: How to convert a file from ISO-8859-1 to UTF-8
on the command line? Is there some command like:
transcode --input-encoding ISO-8859-1 --output-encoding UTF-8 <
/usr/share/dict/words > dict.utf8
--
Masatran, R. Deepak <
http://research.iiit.ac.in/~masatran/>
Other related posts: