[bksvol-discuss] Re: Filter for Duplicates?

  • From: "E." <thoth93@xxxxxxxxxxxxx>
  • To: bksvol-discuss@xxxxxxxxxxxxx
  • Date: Sun, 06 Sep 2009 00:15:37 -0400

Different needs to be kept of course as in the Benford example you mention. But there are duplicates on the system which are the same book without differences.


My point here is bookshare has no way of even finding out about books which might or might not be different and which ought to be checked for just such variation. So we are going to hav to be vigilant until they get better software. Don't regular libraries have such software? Just wondering.

E.



At 08:10 PM 9/5/2009, you wrote:
As long as we remember to distinguish between duplicates of the same edition of a book, and copies of two different editions of the same book on the system. I think Bookshare should keep the latter.

I don't know whether it was an oversight or deliberate, but there are currently two copies of Gregory Benford's In the Ocean of Night on the system, but they are different editions. One is a copy that Carrie scanned and I proofread and was added last December. It is a 2004 edition, and has slightly different text from the PQ version that was added just a little over a month ago. Benford changed the dates in the newer edition of some of the events from the original text, and made some other revisions. The PQ version is an edition with the original 1977 text.

Also, Carrie and my version has an excerpt of another book, and page numbers and page breaks. While I haven't checked it, I would bet that the P"Q version has few--if any--page numbers or page breaks.

I hope that both those copies remain on the system.

Evan

----- Original Message ----- From: "Soronel Haetir" <soronel.haetir@xxxxxxxxx>
To: <bksvol-discuss@xxxxxxxxxxxxx>
Sent: Saturday, September 05, 2009 7:24 PM
Subject: [bksvol-discuss] Re: Filter for Duplicates?


If storage space is a problem then bookshare has more issues than
duplicate books in the collection.

On 9/5/09, Mike <mlsestak@xxxxxxxxxxxxx> wrote:
With one recent exception, the duplicates I have found were different in
quality rating (not PQ vs excellent, but excellent vs fair or good).  I
submitted a book quality report on them and support deleted the lower
quality one.  The exception was a PQ book that the publisher apparently
sent in twice with different author order.  Actually, there was one
other case where support said neither met current bookshare standards,
so they had the outsourcers rescan it.

I can think of two very good reasons to remove such duplicates.  A
reader might come across the poorer quality one and think bookshare
isn't so hot, while if they'd found the excellent one first they would
think bookshare is great.  Also, they take up space on bookshare's system.

Misha

E. wrote:
There probably is a strong reason for bookshare not to have two
different copies of a book with each copy validated by a separate set
of volunteers. For one thing, we all probably want to know which is
the best copy and use that one.
Otherwise, we might as well have dozens of copies of a book on the
system and that certainly seems a bit strange to me at least.

E.

At 04:12 PM 9/5/2009, you wrote:
I have found a few of those duplicates myself. I just let them be. Is
there any strong reason for Bookshare not to have duplicates?


"The end may justify the means as long as there is something that
justifies the end.
" Leon Trotsky

             The Militant:
<http://wwww.themilitant.com>http://www.themilitant.com Pathfinder
Press: <http://www.pathfinderpress.com>http://www.pathfinderpress.com
Granma International:
<http://granma.cu/ingles/index.html>http://granma.cu/ingles/index.html
             _

table with 2 columns and 6 rows
Subj:
[bksvol-discuss] Re: Filter for Duplicates?
Date:
9/5/2009 3:16:27 PM Eastern Daylight Time
From:
thoth93@xxxxxxxxxxxxx
Reply-to:
bksvol-discuss@xxxxxxxxxxxxx
To:
bksvol-discuss@xxxxxxxxxxxxx
Sent from the Internet
(Details)
table end

Engineering tells us they do not even have a way of checking for
duplicate books in the system right now let alone ones which are
coming in.

So, there certainly are duplicate copies of the same book in the
system now. I have found a few myself, distinguished by slightly
different spellings of a author's name or of the title of a book. And
no, I am not talking about duplicates where one is a PQ copy and the
other is a copy submitted by a volunteer. I mean two books each
submitted by a volunteer.

E.

At 03:07 PM 9/5/2009, you wrote:
>Hi,
>
>Since I don't read every post, I may have missed this if someone
>else mentioned it, but here goes.
>
>Have the Bookshare engineers considered having a filter built into
>the site so that if we submit our books and they have elements in
>common with books already in the collection (or even in process, if
>that's possible) an information box could come up informing us of
>this?  Whether we'd want it to appear only if two or more elements
>were the same, such as title and author or title and ISBN, or
>whether we'd like the info to come up even if just the author were
>the same, i don't know.  We could look at the list of similar titles
>and if we realized we were duplicating we could stop the process.
>Obviously we would all prefer to catch the fact that we had a
>duplicate book before we even started scanning, but if we missed
>that fact somehow this would be a checkpoint.  If we wished to
>continue for any reason we could do so and if necessary explain why
>in our comments.
>
>If this has all been mulled over already, I really do apologize.
>
>Ilene
>To unsubscribe from this list send a blank Email to
>bksvol-discuss-request@xxxxxxxxxxxxx
>put the word 'unsubscribe' by itself in the subject line.  To get a
>list of available commands, put the word 'help' by itself in the
subject line.
>
>
>__________ Information from ESET NOD32 Antivirus, version of virus
>signature database 4398 (20090905) __________
>
>The message was checked by ESET NOD32 Antivirus.
>
>http://www.eset.com
>
>

To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a
list of available commands, put the word 'help' by itself in the
subject line.

To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a
list of available commands, put the word 'help' by itself in the
subject line.


 To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of
available commands, put the word 'help' by itself in the subject line.



--
Soronel Haetir
soronel.haetir@xxxxxxxxx
To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line. To get a list of available commands, put the word 'help' by itself in the subject line.


To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line. To get a list of available commands, put the word 'help' by itself in the subject line.


__________ Information from ESET NOD32 Antivirus, version of virus signature database 4399 (20090905) __________

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com



To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of 
available commands, put the word 'help' by itself in the subject line.

Other related posts: