[bksvol-discuss] Re: paperback tip submitted

  • From: "Pratik patel" <pratikp1@xxxxxxxxx>
  • To: <bksvol-discuss@xxxxxxxxxxxxx>
  • Date: Mon, 3 Jul 2006 16:14:22 -0400

Scott,
 
Scott, this suggestion is meant for all paper-bound editions.  thank you for
pointing out my term flip-flops.  I will go in and correct the tip.
 
Regards,
 
Pratik
 
 
 


  _____  

From: bksvol-discuss-bounce@xxxxxxxxxxxxx
[mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx] On Behalf Of Scott Blanks
Sent: Monday, July 03, 2006 4:07 PM
To: bksvol-discuss@xxxxxxxxxxxxx
Subject: [bksvol-discuss] Re: paperback tip submitted


Pratik,
 
While reading your tip, I noticed that you used the term "trade paperback".
As I understand it, this refers to a paperbound book which generally is the
same size as the hardcover edition. Strangely, I've never scanned a single
trade paperback--all of my scans have been of mass market editions, books
which are smaller, and very often contain a lower grade of paper. Do you
have any alternate suggestions for mass market books? Or was this suggestion
meant to refer to all paperbound editions?
 
Scott
 
 

----- Original Message ----- 
From: Pratik patel <mailto:pratikp1@xxxxxxxxx>  
To: bksvol-discuss@xxxxxxxxxxxxx 
Sent: Monday, July 03, 2006 12:23 PM
Subject: [bksvol-discuss] paperback tip submitted

Hello all,
 
Following E's suggestion, I've submitted the following paperback settings
tip to jake's site.  HTH
 
Pratik
 

Scanning Paperback Books With Kurzweil 1000

 

If you've ever scanned paperback books-also frequently known as Trade Paper
books, you will know that the print quality on such books can range from
mediocre to excellent.  Sometimes, the quality may even differ within books
as you move from page to page.  The question becomes how you can scan these
paperback books so that your optical character recognition (OCR) results are
consistent, requiring less effort to edit and saving you time.

 

The section below outlines some scanning tips that you can use with Kurzweil
1000 (V. 9 and above) to make sure that you get the best results.

 

There is one thing that you should understand before following these steps:
While a few users have confirmed the findings that are outlined below, the
results often vary based on your scanner and the drivers provided by your
scanner manufacturer.

 

Please also note that you are about to make changes to the default settings
provided by Kurzweil 1000, you should become very comfortable with
Kurzweil's settings.  They provide an extraordinary control over your
scanning, recognition, and reading environment.  The excellent manual
created by Kurzweil Educational Systems (Help/Open the Manual) will enable
you to gain a thorough understanding of each setting, its corresponding
values, and its effects on the environment.

 

Before you begin:

 

Before you begin making changes to your settings, it is always a good idea
to make sure that you have a backup copy of your original settings.  This
will allow you to go back and restore settings if something goes wrong.  Use
the following steps to create a  backup for your settings.

 

1.       If Kurzweil 1000 is not already running, run the program by
pressing CTRL+ALT+k

2.       Go to the "Folder" menu by pressing ALT+l

3.       Use the down arrow key once to get to "New" or press letter n

4.       Kurzweil 1000 will ask you to enter the name of the new folder you
wish to create.  Type "settings" or "backup settings," depending on your
preference.

5.       Press enter.  Your folder will be created under "c:\documents and
settings\user\my documents\Kurzweil educational systems" where "user" is
your login name.  The newly-created folder is located on the same level as
the "general" folder

6.       Once you've completed creating the folder for your backup settings,
go to the "settings" menu by pressing ALT+t

7.       Use your up and down arrow keys to go to the "backup settings" menu
item or press letter b

8.       Kurzweil 1000 will ask you to choose the folder where your backup
settings should be saved.  Get to the list of folders that Kurzweil 1000
presents by pressing the tab and the shift key.  Use the right and left
arrow keys to open and close the tree items.  Use the up and the down arrow
keys to move to previous and next items.  Get to the folder your created to
hold your backup settings and press enter.

 

Now You're ready to make changes to your settings.  There are two series of
steps you'll take and compare results to find the best OCR settings.  Make
sure you have your paperback book handy.

 

1.       If Kurzweil 1000 is not already running, run the program by
pressing CTRL+ALT+k

2.       Go to the "scan" menu by pressing ALT+s

3.       Use your up and down arrow keys to reach the "optimize scan" item
then press enter or press letter s.  You will be in the "optimize scan"
dialog box

4.       You will land on the threshold optimization item.  By default, this
choice is set to "optimize."  If not, press the space bar key until Kurzweil
1000 says "optimize."

5.       Press the tab key to get to "scanner brightness" and press the
space bar key until Kurzweil 1000 says "optimize"

6.       Press the tab key once again to get to "minimum brightness level."
Leave this option to "20," which is automatically entered for you.  Change
it to "20" if the number is something else.

7.       Press the tab key once again to get to "maximum brightness level."
Leave this option to "80," which is automatically entered for you.  Change
it to "80" if the number is something else

8.       Press the tab key to get to "resolution" and press the space bar
key until Kurzweil 1000 says "optimize"

9.       Press the tab key to get to "speckle removal" and press the space
bar key until Kurzweil 1000 says "optimize"

10.   Press the tab key once again to get to "text quality" and press the
space bar key until Kurzweil 1000 says "do not optimize"

11.   Press the tab key to get to "recognition engine" and press the space
bar key until Kurzweil 1000 says "optimize"

12.   Press the tab key until you get to the "OK" button.  Do not press the
space bar key or enter key yet

13.   Take your paperback book and open it to a random page

14.   Place the book on the scanner with only one page on the scanner
platen.  Make sure that both pages are not on the scanner.  You may need to
make sure that you press down on the book spine in order to allow the
scanner to see the entire page

15.   Returning to your keyboard, press the space bar key to start the
optimization process.  You will need to hold the book in position for
several minutes for the optimization process to complete.  The time may vary
depending on your scanner model.  Kurzweil 1000 will announce when the
optimization process is complete.  You can relax and lay your book aside for
a few moments

16.   Press the tab key several times to review the optimization settings
and press enter to accept the settings as suggested by Kurzweil 1000

17.   Go to the "settings" menu by pressing ALT+t

18.   Use the down arrow key until you get to "recognition" item and press
enter or press letter c

19.   you will land on "column identification."  If Kurzweil 1000 says "on,"
press space bar to turn off automatic column recognition

20.   Press the tab key then press the space bar key until Kurzweil 1000
announces "two pages will be recognized per scan."

21.   Once again press the tab key and the space bar key until Kurzweil 1000
announces "recognition of light text on a dark background is disabled."
Unless you plan to scan the book's dust jacket or the front cover, you will
not need to return and turn this feature on for paperbacks

22.   Press the tab key to hear the "Speckle removal" status.  For now you
should leave this setting as is.  If you recall, this was one of the
settings that was optimized by the "optimize scan" tool earlier

23.   Press the tab key several times until you reach the "partial columns"
settings.  Press the space bar key until Kurzweil 1000 announces "ignore"

24.   Press tab to get to "suspicious regions" and press space bar until
Kurzweil 1000 announces "ignored." (K1000 version 10 only)

25.   Press tab to get to "blank pages" and press space bar until Kurzweil
1000 announces "kept"

26.   Leave the recognition engine and the language setting to their
default.  Press enter to accept these changes

27.   Go to the "settings" menu by pressing ALT+t

28.   Press the up or down arrow keys to get to the "save settings" option
and press enter or press letter v

29.   In the resulting dialog box, type "current" and press enter.  The
settings that you have just modified are now saved in a settings file named
"current."  This will give you a chance to retrieve them in the future if
you find that these settings produce the best results

30.   Pick up the paperback book once again and go to a random page.
Preferably, this page should not be the page that was used to create the
optimized scan results earlier

31.   Place the book on your scanner.  This time, you can scan both pages at
once.  Ensure that you hold the spine down to give the scanner a chance to
view all the contents

32.   Press the scan key (F9) to start scanning the page

33.   Once the scan has complete, turn to the next page and continue
scanning, all the time making sure the hold the spine down

34.   Continue this process until you have scanned 5 times.  You should have
scanned a total of ten pages as you were scanning two pages at once

35.   Wait to have the recognition catch up-until you have ten pages of text

36.   Go to the "tools" menu (ALT+o)

37.   Press the up arrow key to get to "rank spelling" and press enter or
press letter I.  Rank spelling will tell you the percentage of spelling
errors it encountered and will also tell you what percent of the
document-the ten pages you scanned-is without errors

38.   Note down the spelling rate percentage that is given by Kurzweil 1000
and press escape to exit the dialog

39.   Close the file by pressing F4, answering no to the save dialog

 

The second set of steps will go through the process of comparing recognition
results based on a series of settings that have been shown to work well with
paperback books most of the time.

 

1.       You should still be in Kurzweil 1000.  The "current" settings saved
earlier should still be loaded.  Go to the "settings" menu by pressing ALT+t

2.       Navigate to the "scanning" item by using the arrow keys and
pressing enter or by pressing letter s

3.       In the resulting dialog box, press tab twice to get to the
"threshold" setting

4.       Press space bar repeatedly until you hear "grayscale"

5.       Press tab three times to get to "scanner resolution."  Press the
space bar key until Kurzweil 1000 announces "400 dots per inch"

6.       Press enter to accept these changes and return to Kurzweil 1000

7.       Return to the "settings" menu by pressing ALT+t

8.       Arrow down to the "recognition" option and press enter or press
letter C

9.       Press the tab key four times until you get to the "speckle removal"
item.  Press space bar until Kurzweil 1000 announces "disabled"

10.   Press the tab key once again to get to "text quality" and press space
bar until you hear "normal"

11.   Press tab five times to get to "recognition engine" setting.  Press
space repeatedly to get to "FineReader Engine" 7 or 7.1.  The FineReader
engine versions will differ in Kurzweil 1000 V. 10 and V. 9

12.   Press enter to accept the changes you made

13.   Return to the "settings" menu by pressing ALT+t

14.   Press the up or down arrow keys to get to the "save settings" option
and press enter or press letter v

15.   In the resulting dialog box, type "paperback" and press enter.  The
settings that you have just modified are now saved in a settings file named
"paperback."  This will give you a chance to retrieve them in the future if
you find that these settings produce the best results

16.   Pick up the paperback book once again and go to the page where you
began your first scan.  Preferably, this page should not be the page that
was used to create the optimized scan results earlier.  You will be scanning
the same ten pages you scanned the first time to compare your results

17.   Place the book on your scanner.  You can scan both pages at once.
Ensure that you hold the spine down to give the scanner a chance to view all
the contents

18.   Press the scan key (F9) to start scanning the page

19.   Once the scan has completed, turn to the next page and continue
scanning, all the time making sure the hold the spine down

20.   Continue this process until you have scanned 5 times.  You should have
scanned a total of ten pages as you were scanning two pages at once

21.   Wait to have the recognition catch up-until you have ten pages of text

22.   Go to the "tools" menu (ALT+o)

23.   Press the up arrow key to get to "rank spelling" and press enter or
press letter I.  Rank spelling will tell you the percentage of spelling
errors it encountered and will also tell you what percent of the
document-the ten pages you scanned-is without errors

24.   Note down the spelling rate percentage that is given by Kurzweil 1000
and press escape to exit the dialog

25.   Close the file by pressing F4, answering no to the save dialog

 

You will most probably find that your scan results for the second scan are
better than scans from the optimized settings.  However you should go
through these steps each time you start scanning a new paperback to ensure
that you receive the best results.  Now that you already have the
"paperback" settings saved, you need not go through the steps of defining
these settings.  You can go to the "settings" menu and "load settings" for
as many times as you need.

 

You may find the steps listed above a bit intimidating.  The process of
going through these steps is not so work-intensive as it appears.

 

You can address any questions to me at pratikp1 + k1k at gmail dot com

 

Respectfully submitted by Pratik Patel

 

 

 



  _____  




No virus found in this incoming message.
Checked by AVG Free Edition.
Version: 7.1.394 / Virus Database: 268.9.8/380 - Release Date: 6/30/2006


Other related posts: