[liblouis-liblouisxml] Re: [liblouis] r715 committed - the last batch of files converted to utf-8.

  • From: Mesar Hameed <mesar.hameed@xxxxxxxxx>
  • To: liblouis-liblouisxml@xxxxxxxxxxxxx
  • Date: Tue, 3 Jul 2012 16:33:18 +0100

Hi Vic,

On Tue 03/07/12,11:03, Vic Beckley wrote:
> I have found that in the last batch of files that you converted it seems
> that some of the single characters were converted into two-character
> representations.
> For example, on line 411 of the cy-cy-g1.utb table is the
> line for defining the trademark symbol. In the UTF-8 file from revision 715
> it looks like "sign a^" and then the dot pattern. Why would the single
> trademark symbol be converted into a two-character representation?

Yes, from latin1, to their equivilent utf8 representation.
What we are asking John to do is to allow compileTable.c to accept utf8 (which 
may be several bytes long for each character), rather than 
having to write the unicode codepoint as \xhhhh

As you found out in your other email these are currently broken, as i pointed 
out when i committed, they need either to be rewritten as \xhhhh 
or the opcodes need to accept utf8 operands.
You will not get valid output from these opcodes until they are fixed, by one 
of the above solutions.

> I'm sorry if my understanding is wrong and, if so, please correct me.

No, you are quite right, it also helps me to check my understanding when I have 
to explain :)

Thanks,
Mesar
For a description of the software, to download it and links to
project pages go to http://www.abilitiessoft.com

Other related posts: