[bksvol-discuss] Re: validating questions

  • From: "Christina" <greensleeves1@xxxxxxxxx>
  • To: <bksvol-discuss@xxxxxxxxxxxxx>
  • Date: Mon, 15 Dec 2008 00:21:09 -0500

Hello, Channelle.  Welcome to the Bookshare volunteer community.
First off, I'm thrilled that you are a validator who reads through each book. Mayrie already said this pretty well so I'll just leave it at that.

While it is true that we aren't supposed to change the text of the book, it is permissible to correct OCR errors. This includes adding or changing punctuation marks or deleting them if they are obviously junk characters. For example, random semicolons sometimes appear in my scans and I delete as many of them as I can find before submitting them. You can also remove headers and footers and add page numbers if they are missing.

Another way to get ahold of the comments about the book from Step One is to "release" the book on Step Two and then go back to Step One and download it again. It's a bit complicated but it only takes a couple of minutes.

Hope this helps.

----- Original Message ----- From: "Chanelle Hill" <chanellemh@xxxxxxxxx>
To: <bksvol-discuss@xxxxxxxxxxxxx>
Sent: Sunday, December 14, 2008 10:57 PM
Subject: [bksvol-discuss] validating questions

Dear Bookshare Volunteer List,
I am a new volunteer and recently downloaded two books to validate. I
have decided to read the books cover to cover and want to fix some
errors that appear to be made by the scanner. The volunteer manual
states that adding to the text of a book is not permitted; however, in
one of the books I am validating, there are several instances where
the end of a sentence is not punctuated by a period even though the
following sentence begins with a capital letter. Could this be a
scanning error? In addition, there are often spaces between the end of
a word and the close quotation mark so the end quote looks like an
open quote. Is this a common scan error as well? In addition, some
words have an apostrophe around them which should probably be
quotation marks. Sometimes dashes are inserted at inappropriate places
during a word and sometimes they are not printed when a word or name
should be hyphenated. There are often too many spaces between words at
certain points in a book. I am not sure if this is a scanning error or
if the spaces are intended for formatting. I have read sections 2, 3,
and 4 of the volunteer manual and the common scanning errors list and
found no reference to the things I have encountered. I am using a
combination of a BrailleNote classic and Microsoft Word on aWindows XP
computer for my editing.
The volunteer manual states that consistent running headers and
footers will be removed. I am validating a daily meditation book that
has the date for each day's reflection printed a line below the page
break. Would the automated tool remove this information since the
first word (the month) appears for 30-31 pages before switching to the
next month?
In the second book I am validating, the title of the book and the
current chapter are printed in the header on alternating pages.
Sometimes the text is a line below the page break and sometimes it is
not. Would I remove these headers leaving the page numbers intact?
Finally, I forgot to copy down the information on the books'
information pages before downloading. I need to contact the book
submitter to ask a question. Is there any way to access the book
information page again? I am validating Hope for Today by Al-anon
Family Group Headquarters and A Portrait of Jane Austen by David
Cecil. In the Hope for Today book, page numbering begins at page 10,
pages 11 and 12 are not numbered though pagination is intact, and the
page numbering and pagination continues from there. The beginning
pages of the book are paginated but not numbered. I am wondering if
the pages in the book are numbered before page 10.
I apologize for this long email full of questions. Any answers you can
give would be greatly appreciated. If this list is not the place for
questions of this nature, please direct me to the appropriate source.
Thank you.
To unsubscribe from this list send a blank Email to
put the word 'unsubscribe' by itself in the subject line. To get a list of available commands, put the word 'help' by itself in the subject line.

