Re: pdf to txt

  • From: "RaZorbacK" <lists_en@xxxxxxxxxxxxxx>
  • To: jfw@xxxxxxxxxxxxx
  • Date: Thu, 16 Mar 2006 21:55:13 +0100

hello Eileen,

You wrote:

>...not find the window that comes up with the trial version to be
>accessible.  With sighted help, I got it set up and it does do a
good
>job at converting the .pdf to .txt.  
>>
hmm i've just installed it, and it's definetely not accessible and
I'm not even sure that scripts are possible for this app since the
menu is represented as an image .jpg file. So, through windows, it
seems impossible for us without sighted help.
But here are the good news, it's usable through command line and it
works perfectly!
quoting the help file:
" Command line usage:

PDF2TXT <input PDF file> [output TXT file] [-logfile] [-open]
[-space] [-html] [-format] [-silent] [-blankline] [-summary] [-zoom
<num>] [-?] [-h]
 <input PDF file> : Open an existing PDF file to convert.
 [output TXT file] : Write to TEXT file, the default is same
filename of input PDF file.
 [-first <page number>]: Specify the first page number.
 [-last <page number>]: Specify the last page number.
 [-logfile] : Write log to "C:\pdf2txt.log" file.
 [-open] : Auto open the text file after it be created.
 [-space] : Auto insert spaces into text file.
 [-html] : Output to a HTML file, not a text file.
 [-format] : Keep the page layout in the generated TXT file.
 [-silent] : Disable error and warning messages.
 [-blankline] : Auto delete blank line in the generated TXT file.
 [-summary] : Get PDF document summary.
 [-zoom <num>] : Set zoom ratio, the range is from 50 to 200.
 [-?] : Help.
 [-h] : Help.
 For example:
 C:\>PDF2TXT C:\input.pdf
 C:\>PDF2TXT C:\input.pdf -first 10 -last 12
 C:\>PDF2TXT C:\input.pdf C:\output.txt
 C:\>PDF2TXT C:\input.pdf -open -silent -logfile -zoom 150
 C:\>PDF2TXT C:\input.pdf C:\output.txt -open -silent
 C:\>PDF2TXT C:\*.pdf
 C:\>PDF2TXT C:\*.pdf C:\*.txt
 C:\>PDF2TXT C:\test\*.pdf C:\test\*.txt

 Please notice, if your input pdf's filename or output text's
filename contain space(s), you must use quotation mark to include
them, for example,
 C:\>PDF2TXT "C:\123 456.pdf" "C:\my test\aaa.txt""
I've tried without any problem except that the trial version converts
only the first page of your file and can be run only 50 times. Ok, I
agree, it's not so straightforward but it makes the trick quite well
and this huge adobe reader is not required.
For my ocasionnal pdf converting, I prefer using my OCR program.
I'm not a command line guru but feel free to drop a line if needed.
HTH,


--
To post a message to the list, send it to jfw@xxxxxxxxxxxxx
To unsubscribe from this mailing list, send a message to 
jfw-request@xxxxxxxxxxxxx with the word unsubscribe in the subject line.
Archives located at: //www.freelists.org/archives/jfw

If you have any concerns about the list, post received from the list, or the 
way the list is being run, do not post them to the list. Rather contact the 
list owner at jfw-admins@xxxxxxxxxxxxxx

Other related posts: