Re: another pdf to text problem

  • From: "Bob J." <rjustice004@xxxxxxxxxx>
  • To: <programmingblind@xxxxxxxxxxxxx>
  • Date: Thu, 2 Oct 2008 17:31:06 -0700

Bob,

My experience with this conversion process is that the data is all there,
Just not always in the same sequence as in the original document.  I almost
always find it necessary to edit the .txt file to make it correspond to the
original document.  At least, the .txt file can be accessed and I often
embellish the original text to suit my own requirements for readability.

Best wishes,

Bob


----- Original Message ----- 
From: "programming" <rproglock@xxxxxxx>
To: <programmingblind@xxxxxxxxxxxxx>
Sent: Thursday, October 02, 2008 1:51 PM
Subject: Re: another pdf to text problem


Hi,

Thanks for the help in getting the pdf file converted. Your advice worked.
However, when the conversion was completed, the txt file was not right.

Anyway, thanks for the help.

Bob

  ----- Original Message ----- 
  From: InthaneElf
  To: programmingblind@xxxxxxxxxxxxx
  Sent: Wednesday, October 01, 2008 10:33 PM
  Subject: Re: another pdf to text problem


  ah, then you might wish to hit f11 to get the latest copy of PDF2TXT, and
it will be in the main window, just tab across it and there will be a check
box which says "image format"  check it and run the translation, if it can
handle it it will, but keep in mind that this is going to be a slow process,
if it sits there a while, let it sit, for 15 pages, I'd say wait an hour, if
it has not finished off the work by then, and is not counting pages to you
when you sit in the window for a while, then it choked on it, I have 4
scanned books here that it can't translate, there too big for it *sigh*

  of course there between 200 to 400 pages each so...

  you shouldn't have much trouble with your 15 page document, I was up to
page 40 something when P2T froze on me both times I tried it with one of
those books.

  good luck,
  inthane
  proprietor, The Grab Bag,
  for blind computer users and programmers
  http://grabbag.alacorncomputer.com
  Owner: Alacorn Computer Enterprises
  "own the might and majesty of a Alacorn!"
  www.alacorncomputer.com
  Owner: Agemtree
  "merchants in fine facetted and cabochon gemstones"
  www.agemtree.com
  operator: Fruit Basket Demo Sight, where you can find a similar project
done in several programming languages, along with its source code, so you
can decide what language is right for you
  http://fruitbasketdemo.alacorncomputer.com

    ----- Original Message ----- 
    From: programming
    To: programmingblind@xxxxxxxxxxxxx
    Sent: Wednesday, October 01, 2008 7:50 PM
    Subject: Re: another pdf to text problem


    Hi,

    Could you please tell me how to find the checkbox where I set to OCR?

    This might be a stupid question but I can't find it.

    Thanks for your help.

    Bob

      ----- Original Message ----- 
      From: InthaneElf
      To: programmingblind@xxxxxxxxxxxxx
      Sent: Wednesday, October 01, 2008 8:44 PM
      Subject: Re: another pdf to text problem


      did you check the checkbox for using the OCR function in PDF2TXT? and
then try scanning it, it sounds like this is the age old problem of a
scanned image used to create the .PDF, instead of a text document, and will
require OCR to read it if it's possible to do at all.

      inthane
      proprietor, The Grab Bag,
      for blind computer users and programmers
      http://grabbag.alacorncomputer.com
      Owner: Alacorn Computer Enterprises
      "own the might and majesty of a Alacorn!"
      www.alacorncomputer.com
      Owner: Agemtree
      "merchants in fine facetted and cabochon gemstones"
      www.agemtree.com
      operator: Fruit Basket Demo Sight, where you can find a similar
project done in several programming languages, along with its source code,
so you can decide what language is right for you
      http://fruitbasketdemo.alacorncomputer.com

        ----- Original Message ----- 
        From: programming
        To: programmingblind@xxxxxxxxxxxxx
        Sent: Wednesday, October 01, 2008 2:41 PM
        Subject: another pdf to text problem


        Hi list,

        When I open the listed pdf file into PDF-TO -TEXT, I get the
following message:

        "
        Cannot convert August 2008 Beacon.pdf

        File name=C:\PDF2TXT\PDF\August 2008 Beacon.pdf

        File size=1945487

        Author=Panasonic Communications Co.,LTD.

        Title=Network Scan Data

        Subject=MFP Image Format

        Creator=HPDFlib

        Producer=HPDFlib 1.01(MFP)

        PDF version=1.2

        Page count=15

        Number of form fields=0

        User Password=No

        Master Password=No

        Printing=Fully Allowed

        Changing the Document=Allowed

        Content Copying or Extraction=Allowed

        Authoring Comments and Form Fields=Allowed

        Form Field Fill-in or Signing=Allowed

        Content Accessibility Enabled=Allowed

        Document Assembly=Allowed

        Encryption Level=Blank"




        Is there any way to read this pdf file?

        Jamel, would it be OK for me to send you the file so you can work
with it? If so, what is your email address?


        Thanks for any help you can give me as the file is one of my
churches newsletters.

        Bob

__________
View the list's information and change your settings at 
//www.freelists.org/list/programmingblind

Other related posts: