[bksvol-discuss] Re: Filter for Duplicates?

  • From: Rogerbailey81@xxxxxxx
  • To: bksvol-discuss@xxxxxxxxxxxxx
  • Date: Sat, 5 Sep 2009 20:06:13 EDT

I saw at least one duplicate, I now forget the title, which had two 
different copyright dates, one about thirty years earlier than the other. I 
suppose 
that those were two entirely different editions. I did not download and 
read either of them, but there could have been significant differences. The key 
phrase here is could have been. They also could have been virtually 
identical. In a case like this would Bookshare consider them to be duplicates 
and 
remove one?

                                                                          
"The end may justify the means as long as there is something that justifies 
the end. 
" Leon Trotsky     

                 The Militant: http://www.themilitant.com Pathfinder Press: 
http://www.pathfinderpress.com
Granma International: http://granma.cu/ingles/index.html
                 _

table with 2 columns and 6 rows
Subj: 
[bksvol-discuss] Re: Filter for Duplicates?   
Date: 
9/5/2009 5:34:21 PM Eastern Daylight Time  
From: 
mlsestak@xxxxxxxxxxxxx  
Reply-to: 
bksvol-discuss@xxxxxxxxxxxxx  
To: 
bksvol-discuss@xxxxxxxxxxxxx  
Sent from the Internet 
(Details) 
table end

With one recent exception, the duplicates I have found were different in 
quality rating (not PQ vs excellent, but excellent vs fair or good).  I 
submitted a book quality report on them and support deleted the lower 
quality one.  The exception was a PQ book that the publisher apparently 
sent in twice with different author order.  Actually, there was one 
other case where support said neither met current bookshare standards, 
so they had the outsourcers rescan it.

I can think of two very good reasons to remove such duplicates.  A 
reader might come across the poorer quality one and think bookshare 
isn't so hot, while if they'd found the excellent one first they would 
think bookshare is great.  Also, they take up space on bookshare's system.

Misha

E. wrote:
> There probably is a strong reason for bookshare not to have two 
> different copies of a book with each copy validated by a separate set 
> of volunteers. For one thing, we all probably want to know which is 
> the best copy and use that one.
> Otherwise, we might as well have dozens of copies of a book on the 
> system and that certainly seems a bit strange to me at least.
>
> E.
>
> At 04:12 PM 9/5/2009, you wrote:
>> I have found a few of those duplicates myself. I just let them be. Is 
>> there any strong reason for Bookshare not to have duplicates?
>>
>>
>> "The end may justify the means as long as there is something that 
>> justifies the end.
>> " Leon Trotsky
>>
>>              The Militant: 
>> <http://wwww.themilitant.com>http://www.themilitant.com Pathfinder 
>> Press: <http://www.pathfinderpress.com>http://www.pathfinderpress.com
>> Granma International: 
>> <http://granma.cu/ingles/index.html>http://granma.cu/ingles/index.html
>>              _
>>
>> table with 2 columns and 6 rows
>> Subj:
>> [bksvol-discuss] Re: Filter for Duplicates?
>> Date:
>> 9/5/2009 3:16:27 PM Eastern Daylight Time
>> From:
>> thoth93@xxxxxxxxxxxxx
>> Reply-to:
>> bksvol-discuss@xxxxxxxxxxxxx
>> To:
>> bksvol-discuss@xxxxxxxxxxxxx
>> Sent from the Internet
>> (Details)
>> table end
>>
>> Engineering tells us they do not even have a way of checking for
>> duplicate books in the system right now let alone ones which are 
>> coming in.
>>
>> So, there certainly are duplicate copies of the same book in the
>> system now. I have found a few myself, distinguished by slightly
>> different spellings of a author's name or of the title of a book. And
>> no, I am not talking about duplicates where one is a PQ copy and the
>> other is a copy submitted by a volunteer. I mean two books each
>> submitted by a volunteer.
>>
>> E.
>>
>> At 03:07 PM 9/5/2009, you wrote:
>> >Hi,
>> >
>> >Since I don't read every post, I may have missed this if someone
>> >else mentioned it, but here goes.
>> >
>> >Have the Bookshare engineers considered having a filter built into
>> >the site so that if we submit our books and they have elements in
>> >common with books already in the collection (or even in process, if
>> >that's possible) an information box could come up informing us of
>> >this?  Whether we'd want it to appear only if two or more elements
>> >were the same, such as title and author or title and ISBN, or
>> >whether we'd like the info to come up even if just the author were
>> >the same, i don't know.  We could look at the list of similar titles
>> >and if we realized we were duplicating we could stop the process.
>> >Obviously we would all prefer to catch the fact that we had a
>> >duplicate book before we even started scanning, but if we missed
>> >that fact somehow this would be a checkpoint.  If we wished to
>> >continue for any reason we could do so and if necessary explain why
>> >in our comments.
>> >
>> >If this has all been mulled over already, I really do apologize.
>> >
>> >Ilene
>> >To unsubscribe from this list send a blank Email to
>> >bksvol-discuss-request@xxxxxxxxxxxxx
>> >put the word 'unsubscribe' by itself in the subject line.  To get a
>> >list of available commands, put the word 'help' by itself in the 
>> subject line.
>> >
>> >
>> >__________ Information from ESET NOD32 Antivirus, version of virus
>> >signature database 4398 (20090905) __________
>> >
>> >The message was checked by ESET NOD32 Antivirus.
>> >
>> >http://www.eset.com
>> >
>> >
>>
>> To unsubscribe from this list send a blank Email to
>> bksvol-discuss-request@xxxxxxxxxxxxx
>> put the word 'unsubscribe' by itself in the subject line.  To get a 
>> list of available commands, put the word 'help' by itself in the 
>> subject line.
>
> To unsubscribe from this list send a blank Email to
> bksvol-discuss-request@xxxxxxxxxxxxx
> put the word 'unsubscribe' by itself in the subject line.  To get a 
> list of available commands, put the word 'help' by itself in the 
> subject line.
>
>

To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of 
available commands, put the word 'help' by itself in the subject line.

Other related posts: