[liblouis-liblouisxml] Problems with de-de-g0.utb

  • From: Christian Egli <christian.egli@xxxxxxxx>
  • To: liblouis-liblouisxml@xxxxxxxxxxxxx
  • Date: Mon, 27 Oct 2008 17:44:43 +0100

Hi all

I'm trying to use liblouis to translate some german text to Braille. Now
if I use umlauts (i.e. ü) I get some unexpected results:

$ echo 'hübsch' | lou_translate -f de-de-g0.utb
h$"a'\x00bc'bsch

The table de-de.dis defines ü properly as \X00FC in the following line:

display \X00FC  234568 # 252ü Latin small letter u with diaeresis

The table de-de-g0.utb defines it again as follows:

uplow \x00DC\x00FC 1256                         # u with dieeresis

which seems correct to me.

If I put the word 'hübsch' (= 'pretty') into a utf8 encoded file
(umlaut-test) I get above result

$ lou_translate -f de-de-g0.utb < umlaut-test
h$"a'\x00bc'bsch

However if I encode the file with latin-1 I get the result I expect:

$ lou_translate -f de-de-g0.utb < umlaut-test2
h8bsch

How can I translate utf8 content?

Regards
Christian Egli
-- 
Christian Egli
Swiss Library for the Blind and Visually Impaired
Grubenstrasse 12, CH-8045 Zürich, Switzerland

For a description of the software and to download it go to
http://www.jjb-software.com

Other related posts:

  • » [liblouis-liblouisxml] Problems with de-de-g0.utb