Furthermore, you can find any missing or duplicate pages in a 2000 page books in only 11 lookups. Use binary searching. The complexity of the algorythm is only log N, where N is the number of pages in the book. G. Guido D. Corona IBM Accessibility Center, Austin Tx. IBM Research, Phone: (512) 838-9735 Email: guidoc@xxxxxxxxxxx Visit my weekly Accessibility WebLog at: http://www-3.ibm.com/able/weblog/corona_weblog.html "Kenneth A. Cross" <crossk@xxxxxxxxxxxx> Sent by: bksvol-discuss-bounce@xxxxxxxxxxxxx 08/10/2004 05:38 PM Please respond to bksvol-discuss To <bksvol-discuss@xxxxxxxxxxxxx> cc Subject [bksvol-discuss] Re: 550 books in the download queue In almost every book I have scanned, which is about a thousand, I have rarely found the pagination quite that exact. First, pictures are inserted, sometimes on un-numbered pages. Also, some pages, such as charts and graphs, are either un-numbered of just plain missing. Suppose I get a two-thousand page book from a library and scan it perfectly, but miss one index page. Do you really want that book trashed? We are not talking about simple issues here. We are talking about a group of people, I mean us, who have spent most of our lives without immediate access to current books, however bad the form. Let's not give that up in search of perfection. ----- Original Message ----- From: "E." <thoth93@xxxxxxxxxxxxx> To: <bksvol-discuss@xxxxxxxxxxxxx> Sent: Tuesday, August 10, 2004 4:18 PM Subject: [bksvol-discuss] Re: 550 books in the download queue > Search for page numbers as in 79 followed by 80 etc. until you go through > the book's page numbers. It sounds like a lot of work but really does not > take much time. > > E. > At 03:55 PM 8/10/2004, you wrote: > >I've hesitated in offering my services in validating due to my concern > >(perhaps unwarranted) that I couldn't do the maticulous job some of yu > >obviously do based on the books I routinely read from BookShare. > > > >At this point, I've not purchased any of the fancy ocr packages such as > >k100 or Open Book which would make reviewing submissions a relative > >breeze. In reviwing the outlined tasks involved in validating, the > >stickler would be the one concerning making certain that the text is > >incomplete form and no pages are mmissing. Other than reading the entire > >book, how would one go about this task without a program such as K100 > >which seems more and more impressive every time you folks discuss it? > > > > > > > >