[brailleblaster] Re: Translation of character entities

  • From: Keith Creasy <kcreasy@xxxxxxx>
  • To: "brailleblaster@xxxxxxxxxxxxx" <brailleblaster@xxxxxxxxxxxxx>
  • Date: Fri, 16 Mar 2012 10:26:23 -0400

OK, thanks guys. I think I got this one but now for another question.

in the char defs the char 0x000a (a new line char) is categorized as a space. 
It seems like liblouis is always translating this char to a space, in other 
words stripping out line feed chars and replacing them with a blank space. Is 
this indeed normal behavior? 

Where should I look to change it so that new-line chars are preserved?


Thanks.

Keith

-----Original Message-----
From: brailleblaster-bounce@xxxxxxxxxxxxx 
[mailto:brailleblaster-bounce@xxxxxxxxxxxxx] On Behalf Of Michael Whapples
Sent: Thursday, March 15, 2012 12:10 PM
To: brailleblaster@xxxxxxxxxxxxx
Subject: [brailleblaster] Re: Translation of character entities

I think the hard part is actually getting BrailleBlaster to offer a user 
friendly way of editing this stuff.

Michael Whapples

-----Original Message-----
From: John J. Boyer
Sent: Wednesday, March 14, 2012 1:03 PM
To: brailleblaster@xxxxxxxxxxxxx
Subject: [brailleblaster] Re: Translation of character entities

This has worked from the beginning. What you may have to do is define some of 
the characters in liblouis tables. The hard part of that will be deciding what 
dot patterns to use. There are some common Unicode characters for which I don't 
know the correct dot patterns after years of searching. Good luck with the 
uncommon ones.

John

On Wed, Mar 14, 2012 at 08:22:49AM -0400, Keith Creasy wrote:
> OK, that sounds good. Are you saying that it "will" do this at some 
> future date or that it can be done now? Is this something we may need to work 
> on?
>
> It isn't critical but I'm trying to develop a mental catalog of what 
> remains to be done.
>
> Keith
>
>
> -----Original Message-----
> From: brailleblaster-bounce@xxxxxxxxxxxxx
> [mailto:brailleblaster-bounce@xxxxxxxxxxxxx] On Behalf Of John J. 
> Boyer
> Sent: Tuesday, March 13, 2012 3:48 PM
> To: brailleblaster@xxxxxxxxxxxxx
> Subject: [brailleblaster] Re: Translation of character entities
>
> These entities will be converted to UTF-8 by the xml parser.
> liblouisutdml will then convert them to UTF-16 or UTF-32, depending on 
> how it was compiled. This is not important. liblouisutdml then passes 
> them  to liblouis for translation. The liblouis table used must 
> contain a character definition such as
>
> sign \x1234 16-23
>
> This is a made-up example. the word sign indicates what kind of 
> character it is. the \x gives the hex value the numbers in the third 
> column give the dot pattern. If liblouis does not have a definition 
> for a particular character it will place it in the output in the form 
> '\xhhhh' so the user can look it up and supply a definition.
>
> John
>
> On Tue, Mar 13, 2012 at 03:35:34PM -0400, Keith Creasy wrote:
> > Guys.
> >
> > What is the best mechanism for correcting the translation of special 
> > character entities, such as &#2220;, so the such items in the 
> > NIMAS/DTBook file come out right?
> >
> > I'm running files through and letting others here take a look and 
> > the results are pretty good so far.
> >
> > Thanks.
> >
> > Keith
> >
>
> --
> My websites:
> GodTouches Digital Ministry, Inc. http://www.godtouches.org 
> Abilitiessoft, Inc. http://www.abilitiessoft.com
> Location: Madison, Wisconsin, USA
>

--
John J. Boyer; President, Chief Software Developer Abilitiessoft, Inc.
http://www.abilitiessoft.com
Madison, Wisconsin USA
Developing software for people with disabilities




Other related posts: