blind_html [Fwd: PDF2TXT 3.3 released]

  • From: Nimer <nimerjaber1@xxxxxxxxx>
  • To: blind_html@xxxxxxxxxxxxx
  • Date: Sat, 04 Apr 2009 09:43:04 -0600

"every time I say something they find hard to hear
they chalk it up to my anger
and never to their own fear"
Ani Difranco: I'm Not A Pretty Girl 1995

Nimer M. Jaber

The information transmitted is intended only for the person or entity to which 
is addressed and may contain confidential and/or privileged material. Any 
retransmission, dissemination or other use of, or taking of any action in 
upon this information by persons or entities other than the intended recipient 
prohibited. If you received this in error, please contact the sender via reply 
e-mail, and delete the
material from any computer.

(720) (251-4530)

-------- Original Message --------
Subject:        PDF2TXT 3.3 released
Date:   Sat, 4 Apr 2009 03:54:56 -0500 (EST)
From:   Jamal Mazrui <empower@xxxxxxxxx>
Reply-To:       programmingblind@xxxxxxxxxxxxx
To: GUISpeak@xxxxxxxxxxxxx, ProgrammingBlind@xxxxxxxxxxxxx, Program-L@xxxxxxxxxxxxx, UAccess-L@xxxxxxxxxxxxxxx

PDF2TXT now uses an updated version of the QuickPDF library, which is
commercially available at

Several years worth of fixes and enhancements are incorporated in this
library version compared to the previous one used.  Although the source
code to this and other tools used by PDF2TXT is not available, its own
source code in the PowerBASIC language, PDF2TXT.bas, is now installed
along with the executable.

As before, the Image Format checkbox (Alt+F) is available for optical
character recognition (OCR) -- using Google Tesseract technology -- on
image-based PDFs that elude text extraction methods.  Due to technical
issues, there is not a simple way of aborting an OCR process that has
already started.  This is now possible, however, via a work-around of
launching another copy of PDF2TXT.  It clears the deck during its startup
phase.  The Quit button (Alt+Q) may then be invoked to close either copy
of the program (though no harm results from both being loaded).

As Before the Grab URL button (Alt+G) gets the address of the current web
page in Internet Explorer and sets it as the PDF source.  This now works
with versions of Internet Explorer above 6.0.  The feature makes it easy
to download and convert all PDFs linked to a web page.

An additional hotkey is introduced:  F11 for Elevate Version (like the
EdSharp and FileDir programs).  This checks whether a newer version of
PDF2TXT is available, and offers to install it.  The command makes future
updates to the program particularly convenient to obtain.

As before, the Extra HTML checkbox (Alt+X) uses a different conversion
technology to produce a .htm conversion in addition to the .txt one.  To
further increase conversion options via this checkbox, it now also causes
another technology to be used for producing the .txt file, using the
PDFToText.exe utility that is also seperately available at
Thus PDF2TXT now incorporates three different .txt conversion methods, a
.htm method, and an OCR one -- all of which are possible in a batch mode
that processes every PDF in a directory.  The program has become the most
capable, free converter of PDFs available on Windows!


View the list's information and change your settings at

To unsubscribe, please send a blank email to
with unsubscribe in the subject line.
To access the archives, please visit:


Other related posts:

  • » blind_html [Fwd: PDF2TXT 3.3 released] - Nimer