[liblouis-liblouisxml] Re: 2 questions: 1. charsets for output, and 2. Python bindings for hyphenation

  • From: "John J. Boyer" <john.boyer@xxxxxxxxxxxxxxxxx>
  • To: liblouis-liblouisxml@xxxxxxxxxxxxx
  • Date: Fri, 12 Feb 2010 09:30:43 -0600

Instead of adding the display lines to your translation table 9it would 
be better to put them in a separate file, perhaps called unicode.dis and 
then use a table list with this file as the first member. I may put this 
file in the tables directory in svn. Page break, page end and linefill 
can all be set with configuration files. Look at canonical.cfg. Page 
break line end, etc. can be set with
\xhhhh Line fill will only accept an ASCII character at this point.

John

On Fri, Feb 12, 2010 at 11:37:30AM +0100, Bert Frees wrote:
> If you include following lines in your translation tables (and set 
> outputEncoding=UTF8), xml2brl will output Unicode braille. But some 
> characters are displayed the same as before. For example: the lineFill 
> symbol ('), the page break indicator 
> (-------------------------------------) and the page number prefixes of 
> continued pages (a, b, c, d, e, ...). Is there a solution for this?
> 
> 
> display \x2800 0
> display \x2801 1
> display \x2802 2
> display \x2803 12
> display \x2804 3
> display \x2805 13
> display \x2806 23
> display \x2807 123
> display \x2808 4
> display \x2809 14
> display \x280A 24
> display \x280B 124
> display \x280C 34
> display \x280D 134
> display \x280E 234
> display \x280F 1234
> display \x2810 5
> display \x2811 15
> display \x2812 25
> display \x2813 125
> display \x2814 35
> display \x2815 135
> display \x2816 235
> display \x2817 1235
> display \x2818 45
> display \x2819 145
> display \x281A 245
> display \x281B 1245
> display \x281C 345
> display \x281D 1345
> display \x281E 2345
> display \x281F 12345
> display \x2820 6
> display \x2821 16
> display \x2822 26
> display \x2823 126
> display \x2824 36
> display \x2825 136
> display \x2826 236
> display \x2827 1236
> display \x2828 46
> display \x2829 146
> display \x282A 246
> display \x282B 1246
> display \x282C 346
> display \x282D 1346
> display \x282E 2346
> display \x282F 12346
> display \x2830 56
> display \x2831 156
> display \x2832 256
> display \x2833 1256
> display \x2834 356
> display \x2835 1356
> display \x2836 2356
> display \x2837 12356
> display \x2838 456
> display \x2839 1456
> display \x283A 2456
> display \x283B 12456
> display \x283C 3456
> display \x283D 13456
> display \x283E 23456
> display \x283F 123456
> display \x2840 7
> display \x2841 17
> display \x2842 27
> display \x2843 127
> display \x2844 37
> display \x2845 137
> display \x2846 237
> display \x2847 1237
> display \x2848 47
> display \x2849 147
> display \x284A 247
> display \x284B 1247
> display \x284C 347
> display \x284D 1347
> display \x284E 2347
> display \x284F 12347
> display \x2850 57
> display \x2851 157
> display \x2852 257
> display \x2853 1257
> display \x2854 357
> display \x2855 1357
> display \x2856 2357
> display \x2857 12357
> display \x2858 457
> display \x2859 1457
> display \x285A 2457
> display \x285B 12457
> display \x285C 3457
> display \x285D 13457
> display \x285E 23457
> display \x285F 123457
> display \x2860 67
> display \x2861 167
> display \x2862 267
> display \x2863 1267
> display \x2864 367
> display \x2865 1367
> display \x2866 2367
> display \x2867 12367
> display \x2868 467
> display \x2869 1467
> display \x286A 2467
> display \x286B 12467
> display \x286C 3467
> display \x286D 13467
> display \x286E 23467
> display \x286F 123467
> display \x2870 567
> display \x2871 1567
> display \x2872 2567
> display \x2873 12567
> display \x2874 3567
> display \x2875 13567
> display \x2876 23567
> display \x2877 123567
> display \x2878 4567
> display \x2879 14567
> display \x287A 24567
> display \x287B 124567
> display \x287C 34567
> display \x287D 134567
> display \x287E 234567
> display \x287F 1234567
> display \x2880 8
> display \x2881 18
> display \x2882 28
> display \x2883 128
> display \x2884 38
> display \x2885 138
> display \x2886 238
> display \x2887 1238
> display \x2888 48
> display \x2889 148
> display \x288A 248
> display \x288B 1248
> display \x288C 348
> display \x288D 1348
> display \x288E 2348
> display \x288F 12348
> display \x2890 58
> display \x2891 158
> display \x2892 258
> display \x2893 1258
> display \x2894 358
> display \x2895 1358
> display \x2896 2358
> display \x2897 12358
> display \x2898 458
> display \x2899 1458
> display \x289A 2458
> display \x289B 12458
> display \x289C 3458
> display \x289D 13458
> display \x289E 23458
> display \x289F 123458
> display \x28A0 68
> display \x28A1 168
> display \x28A2 268
> display \x28A3 1268
> display \x28A4 368
> display \x28A5 1368
> display \x28A6 2368
> display \x28A7 12368
> display \x28A8 468
> display \x28A9 1468
> display \x28AA 2468
> display \x28AB 12468
> display \x28AC 3468
> display \x28AD 13468
> display \x28AE 23468
> display \x28AF 123468
> display \x28B0 568
> display \x28B1 1568
> display \x28B2 2568
> display \x28B3 12568
> display \x28B4 3568
> display \x28B5 13568
> display \x28B6 23568
> display \x28B7 123568
> display \x28B8 4568
> display \x28B9 14568
> display \x28BA 24568
> display \x28BB 124568
> display \x28BC 34568
> display \x28BD 134568
> display \x28BE 234568
> display \x28BF 1234568
> display \x28C0 78
> display \x28C1 178
> display \x28C2 278
> display \x28C3 1278
> display \x28C4 378
> display \x28C5 1378
> display \x28C6 2378
> display \x28C7 12378
> display \x28C8 478
> display \x28C9 1478
> display \x28CA 2478
> display \x28CB 12478
> display \x28CC 3478
> display \x28CD 13478
> display \x28CE 23478
> display \x28CF 123478
> display \x28D0 578
> display \x28D1 1578
> display \x28D2 2578
> display \x28D3 12578
> display \x28D4 3578
> display \x28D5 13578
> display \x28D6 23578
> display \x28D7 123578
> display \x28D8 4578
> display \x28D9 14578
> display \x28DA 24578
> display \x28DB 124578
> display \x28DC 34578
> display \x28DD 134578
> display \x28DE 234578
> display \x28DF 1234578
> display \x28E0 678
> display \x28E1 1678
> display \x28E2 2678
> display \x28E3 12678
> display \x28E4 3678
> display \x28E5 13678
> display \x28E6 23678
> display \x28E7 123678
> display \x28E8 4678
> display \x28E9 14678
> display \x28EA 24678
> display \x28EB 124678
> display \x28EC 34678
> display \x28ED 134678
> display \x28EE 234678
> display \x28EF 1234678
> display \x28F0 5678
> display \x28F1 15678
> display \x28F2 25678
> display \x28F3 125678
> display \x28F4 35678
> display \x28F5 135678
> display \x28F6 235678
> display \x28F7 1235678
> display \x28F8 45678
> display \x28F9 145678
> display \x28FA 245678
> display \x28FB 1245678
> display \x28FC 345678
> display \x28FD 1345678
> display \x28FE 2345678
> display \x28FF 12345678
> 
> 
> 
> On 10/02/2010 23:40, John J. Boyer wrote:
> >On the first question, the display opcode is intended to handle just
> >such differences in embosser (and braille display) character sets.
> >Thered are several files intended for this purpose in the tables
> >directory. They have the extension .dis . The most useful one is named
> >something like en-us-brf.dis and converts the lowercase letters usually
> >produced by liblouis to upp-er-case letters for devices that injsist on
> >them.
> >
> >Bindiings for lou_hyphenate would be nice, but the function only returns
> >a string indicating where words could be hypyhenated. To use it you need
> >a formatter like liblouisxml, which does handle text as well as xml.
> >
> >If you want actual Unicode dot patterns from liblouis youu can use the
> >dotsIO mode. /this is defined in liblouis.h . You should use the name.
> >It can be ored with other mode bits.
> >
> >John
> >
> >On Wed, Feb 10, 2010 at 11:12:14PM +0100, Samuel Thibault wrote:
> >   
> >>Leo, le Wed 10 Feb 2010 22:36:01 +0100, a ?crit :
> >>     
> >>>1. When translating texts with different tables for different languages, 
> >>>it
> >>>appears that I have to switch to specific char sets at my embosser to 
> >>>print
> >>>the results properly. I find this misfortunate. Shouldn't ideally all 
> >>>tables
> >>>work on the same charset (say, Braille unicode)
> >>>       
> >>Or ISO TR_11548-1, which just encodes the lower 8 bits of the unicode
> >>coding.
> >>
> >>     
> >>>to allow simple back-ends translating unicode to the respective char
> >>>set of the embosser?
> >>>       
> >>That's how I would see it work, yes (that's how brltty works).
> >>
> >>Samuel
> >>For a description of the software and to download it go to
> >>http://www.jjb-software.com
> >>     
> >   
> 
> For a description of the software and to download it go to
> http://www.jjb-software.com

-- 
John J. Boyer; President, Chief Software Developer
Abilitiessoft, Inc.
http://www.abilitiessoft.com
Madison, Wisconsin USA
Developing software for people with disabilities

For a description of the software and to download it go to
http://www.jjb-software.com

Other related posts: