[bksvol-discuss] Re: Some Newbie Questions

  • From: "Larry Lumpkin" <llumpkin@xxxxxxxxxxxxx>
  • To: <bksvol-discuss@xxxxxxxxxxxxx>
  • Date: Sat, 7 Nov 2015 18:54:47 -0600

Hi William. I’ve been scanning since 2001 or 2002. I personally don’t do much
checking of my scans. I do spot check pages during my scan to see if there are
any major problems that might make me abort the scan. I use k1000 and try to
get the best scan I can but leave correction up to the proofer. I set my
confidence in Kurzweil to 99.5 and if the confidence goes much lower than that
(such as 99.1 or 99.2) I abort the scan.

From: bksvol-discuss-bounce@xxxxxxxxxxxxx
[mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx] On Behalf Of William Korn
(Redacted sender "willythekorn" for DMARC)
Sent: Saturday, November 07, 2015 6:29 PM
To: bksvol-discuss@xxxxxxxxxxxxx
Subject: [bksvol-discuss] Some Newbie Questions

I'm about halfway through scanning my first book to submit to Bookshare. I've
studied such instructions as there are on the website (and even tried to link
to the Sanncer FAQ, but that link seems to be dead.)

Although I understand that the scanner is not to proofread the book, I took the
liberty of proofing the first 40 pages, more to see how well the scanner and
the OCR software were functioning than anything else. It's functioning pretty
well, I guess. I found about a dozen "scannos" (which I also took the liberty
to correct). Most of them were either incorrect letters produced from two
other letters, or problems with the software interpreting the "1" in a page
number as an "I", others were words broken across a line in the book, but not
in the resulting .RTF file (yet still including the dash)..

In the guidelines the scanner is asked to review the .RTF file for "minimal
scannos", which leads to three questions:

1) How is "minimal" defined? Are the scannos I found above "minimal"?

2) Should I correct those scannos before uploading?

3) How is reviewing for scannos different than proofreading? They weren't
obvious things like junk characters. I would not have found them unless I read
the .RTF file.

My other major question is, how do proofers proofread the book without having
the book itself in hand? This particular edition of the book was printed in
1973 (the original copyright was 1934) and is long out of print. Subsequent
editions have been printed since then by other publishers.

William Korn

Other related posts: