[bksvol-discuss] Proofing the impossible proof

  • From: "Aaron Cannon" <cannona@xxxxxxxxxxxxxxxxxxxxxx>
  • To: <bksvol-discuss@xxxxxxxxxxxxx>
  • Date: Fri, 8 Aug 2008 08:42:59 -0500

-----BEGIN PGP SIGNED MESSAGE-----
Hash: RIPEMD160

Hi all.

It has bothered me for sometime that Bookshare doesn't have the important
reference books that are central to any good library.  We have no
dictionary, thesaurus, Chicago Manual of Style, and others.  Primarily this
is due to the fact that these books are huge, and don't scan well.  So what
are our options?

The best option, and one that I hope Bookshare is working on, is getting the
text direct from the publisher.  However, failing that, we may still be able
to scan and proof these books.  I would propose borrowing a concept
developed by distributed proofreaders, a group working with Project
Gutenberg.  Basically, volunteers visit a web page which shows an image of a
single page from a book, and a textbox with the OCR output for that same
page.  The proofer then corrects the output and submits it back to the
server, all within their web browser.  It seems to me that this concept
would work just as well for us.

Thoughts?  Has this been considered and rejected for reasons of which I am
unaware?

Of course, the other big hurtle would be to ensure that these books were
added to the collection with a more advanced navigation structure in the
Daisy XML than is typical, otherwise they may end up being quite unusable.
What's the use of having a dictionary that is impossible to navigate?

Thanks.

Aaron



- --
Skype: cannona
MSN/Windows Messenger: cannona@xxxxxxxxxxx (don't send email to the hotmail
address.)

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.8 (MingW32) - GPGrelay v0.959
Comment: Key available from all major key servers.

iEYEAREDAAYFAkicTWcACgkQI7J99hVZuJctuQCZATl97jVtBCvSqbHc+ATmqWY8
+KEAn154ICSxCWdxhLJAC+I9TPpKBPmL
=6DlO
-----END PGP SIGNATURE-----
To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of 
available commands, put the word 'help' by itself in the subject line.

Other related posts: