[liblouis-liblouisxml] Translating PDF to Braille

  • From: Michael Whapples <mwhapples@xxxxxxx>
  • To: liblouis-liblouisxml@xxxxxxxxxxxxx
  • Date: Thu, 20 May 2010 16:54:22 +0100

Hello,
Now I have to be honest, although I am asking this I am not optimistic of a great solution existing. How might one go about translating a PDF to Braille?

One possible way I am thinking of is to use a tool to convert the file to some sort of XML (eg. pdftohtml I think has an option for XML). Now this might introduce errors (I haven't really found anything which can extract the text in a good way reliably, any suggestions of applications would be good). Then I think we would be able to use liblouisxml on the XML (this may require some configuration files).

Unfortunately I think all the PDFs I am dealing with are untagged and will probably need much work to get anything sensible out (eg. the source PDFs have margin notes, diagrams with text labels embedded in them, etc which lead to lots of stray text appearing in the main text flow).

If anyone has any views/suggestions on this or even know of a working solution I would be grateful.

Thank you

Michael Whapples
For a description of the software and to download it go to
http://www.jjb-software.com

Other related posts: