[bksvol-discuss] Re: 550 books in the download queue

  • From: Guido Corona <guidoc@xxxxxxxxxx>
  • To: bksvol-discuss@xxxxxxxxxxxxx
  • Date: Wed, 11 Aug 2004 10:27:38 -0500

Furthermore,  you can find any missing or duplicate pages in a 2000 page 
books in only 11 lookups.  Use binary searching.  The complexity of the 
algorythm is only log N, where N is the number of pages in the book.

G.


Guido D. Corona
IBM Accessibility Center,  Austin Tx.
IBM Research,
Phone:  (512) 838-9735
Email: guidoc@xxxxxxxxxxx

Visit my weekly Accessibility WebLog at:
http://www-3.ibm.com/able/weblog/corona_weblog.html





"Kenneth A. Cross" <crossk@xxxxxxxxxxxx> 
Sent by: bksvol-discuss-bounce@xxxxxxxxxxxxx
08/10/2004 05:38 PM
Please respond to
bksvol-discuss


To
<bksvol-discuss@xxxxxxxxxxxxx>
cc

Subject
[bksvol-discuss] Re: 550 books in the download queue






In almost every book I have scanned, which is about a thousand, I have
rarely found the pagination quite that exact.  First, pictures are 
inserted,
sometimes on un-numbered pages.  Also, some pages, such as charts and
graphs, are either un-numbered of just plain missing.  Suppose I get a
two-thousand page book from a library and scan it perfectly, but miss one
index page.  Do you really want that book trashed?  We are not talking 
about
simple issues here.  We are talking about a group of people, I mean us, 
who
have spent most of our lives without immediate access to current books,
however bad the form.  Let's not give that up in search of perfection.
----- Original Message ----- 
From: "E." <thoth93@xxxxxxxxxxxxx>
To: <bksvol-discuss@xxxxxxxxxxxxx>
Sent: Tuesday, August 10, 2004 4:18 PM
Subject: [bksvol-discuss] Re: 550 books in the download queue


> Search for page numbers as in 79 followed by 80 etc. until you go 
through
> the book's page numbers.  It sounds like a lot of work but really does 
not
> take much time.
>
> E.
> At 03:55 PM 8/10/2004, you wrote:
> >I've hesitated in offering my services in validating due to my concern
> >(perhaps unwarranted) that I couldn't do the maticulous job some of yu
> >obviously do based on the books I routinely read from BookShare.
> >
> >At this point, I've not purchased any of the fancy ocr packages such as
> >k100 or Open Book which would make reviewing submissions a relative
> >breeze.  In reviwing the outlined tasks involved in validating, the
> >stickler would be the one concerning making certain that the text is
> >incomplete form and no pages are mmissing.  Other than reading the 
entire
> >book, how would one go about this task without a program such as K100
> >which seems more and more impressive every time you folks discuss it?
> >
> >
>
>
>
>



Other related posts: