Re: PDF generated by LaTeX

  • From: Don Marang <donald.marang@xxxxxxxxx>
  • To: programmingblind@xxxxxxxxxxxxx
  • Date: Sat, 06 Nov 2010 13:14:31 -0400

 There are utilities in Ubuntu, such as pandoc, which perform
conversions between LaTeX, ePub, HTML, and other markup / markdown
formats.  It is cross platform and is available free for Windows as
well.  Do you have access to the LaTeX version of the documents or just
the PDF?  There are lots of methods to take structured text in many
formats, such as Word, and produce a structured PDF.  However, I have
not seen any utilities to retain structure when extracting text from a
PDF.  By design, it was meant to be one way.  If anyone has found this
capability, without resorting to OCR, I would be interested. 

Don Marang
There is just so much stuff in the world that, to me, is devoid of any
real substance, value, and content that I just try to make sure that I
am working on things that matter. Dean Kamen

On 11/6/2010 10:22 AM, QuentinC wrote:
> Do I really need to pass through an OCR ? Is it the only good solution ?
> I have omnipage 11... but I find it a bit strange since LaTeX is
> basicly text, there should be a shorter way.
> And OCR make many errors...
>
> OR dou you know a delatexizer program ? It may be useful...
>
>
> __________
> View the list's information and change your settings at
> //www.freelists.org/list/programmingblind
>
__________
View the list's information and change your settings at 
//www.freelists.org/list/programmingblind

Other related posts: